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To Print or Not To Print 


Here is yet another bumper issue of AUUGN. Yes, I know its a bit late, but I 
have been having my house renovated, and that comes before this. 

Due to financial limitations I have had to adopt some fairly strict control 
over xdiat has gone into this issue and reserve a lot of material for inclusion 
in latter issues. Tie editorial committee (an ad hoc group of people 
including John Lions and Kevin Hill) have been asked to comment on what should 
be included, but unfortunately considerations such as postal weight have 
forced a number of items into the list below. This list gives a brief summary 
of the stuff I have on hand, available on request, which may be included in 
future AUUGNs. 

Various advertisements from the last US meeting offering UNIX 
or UNIX-like software and/or documentation. 

"Programming in QED: A Tutorial" plus a Qed manual entry 

Writeup on the Georgia Tech 'Software Tools' Subsystem 

An invitation to a US general access UNIX network 

Quite a large manual entry for the Tilbrook Information System 

A list of names and affiliations of the 450 or so attendees at 
the TJS meeting 

"Macros for Analyzing C Program Arguments" by John Lions 

A transcription from a tape that Ian Johnstone sent us about 
the US software tools and UNIX meetings (suitably edited). 

A study of the way UNIX boots itself into operation. 

Most of the normaT correspondence. 
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You may also notice a slight change in the page length of this issue. People, 
particularly in the USA, have pointed out that it is very difficult to copy 
Australian 'A4' pages on to smaller pages used as standard in other countries. 
All contributors please note the need for more white space. 


US User Meeting Summary 


Included in this issue is a rather detailed review of the last US meeting. 


Overseas Arrangements 


This issue contains material from the last UKUUGN which I received recently as 
part of the exchange agreement with the UK users group. I have written, 
formally requesting similar agreements to the two US users groups plus the 
smaller (no newsletter etc.) Canadian group. I hope to have exchange 
arrangements agreed upon by late May and obtain material for the next 
newsletter. 


Still More About Vaxes 


A VAX11/780 will be delivered to the School of Electrical Engineering at the 
University of New South Wales late in June 1980. It will be used for teaching 
in the Department of Computer Science, to relieve the massive overloading 
experienced by the Department over the last year or more. 

For those of you interested in the Berkeley VAX software, this newsletter has 
four papers on design, implementation, experience and evaluation of this 
software. Should these papers stir you* to purchase VMUNIX, I have also 
enclosed a license agreement. 


International Meeting 


The International UNIX meeting has been planned for 18th / 19th of October. 
Robert Elz has written detailing progress and imploring users, both local and 
overseas to reply to the announcement even if they are not comming or do not 
know for certain. His letter appears latter in this issue. Considering that 
eight of the fifteen replies so far received have been from the UK and the 
USA, local group members are showing disproportionate apathy. 
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Dial-in Facilities 


There are a few dial-in lines available to our readers at the Uni of New South 
Wales, and Sydney Uni* The Australian Graduate School of Management, where 
the local software data-base is maintained allows visitors to login on 
(02)662-1666 for read-only access to software. Mail 'kev' on that machine if 
you cant find something. 

The Elec Eng machine allows access on (02)662-1733. Mail to the AUUGN editor 
may be sent to 'auugn' and some other noteable persons have accounts on this 
machine including John Lions ('johnl'). 

The Basser Dept of Computer Science at Sydney Uni, on (02)660-5772, is the 
entry to a small subsection of the local network. Mailing on this machine it 
is possible to contact: 

piers Piers Diclc-Lauder 
chris Chris Maltby 

chrisr Chris Rowles (large mini UNIX person) 

The login name for all the above numbers is 'visitor'• 


The Software Catalogue 


To date I have received a grand total of four (4) replies, pledging between 
six and seven hundred dollars towards the project. At the last meeting I 
counted at least a dozen people who said their site would be prepared to 
contribute a substantial amount of money. 

I am amazed by the apathy of you readers out there. A site in ISRAEL who had 
not even subscribed until this issue took the trouble to obtain a software 
catalogue form and offer to make a donation. Yet all you locals could only 
manage three replies! ill* 

I know that students work for peanuts, but at least chip in to by the paper 
bag to put the food in!!! 


Site Surveys 


Well considering there are only four sites in our whole readership I wonder 
how the rest of you do any computing. Only four replies, not the same four as 
above, from a total readership of sixty-five. Honestly, I feel that if I 
enclosed a postage payed envelope to return forms in, you still would not fill 
them in and post them. 

I have included a site survey form from the UK group in the hope that a second 
(non-manufacturer-biased) attempt may get a better response. Unless some more 
enthusiasm is shown I shall not even bother to send forms out (be they anti 
Perkin-Elmer or not - thanks for your reply Juris). 
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Peter Ivanov 

Dept, of Computer Science 
Electrical Engineering 
PO Box 1 
Kensington 2033 
AUSTRALIA 

(02) 662-3781 
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DEPARTMENT OF COMPUTER SCIENCE 


Parkville, Victoria 3052 


12th April, 1980 


Peter Ivanov, 

Dept, of Computer Science, 
Electrical Engineering, 
University of New South Wales, 
Kensington. 


Dear Peter, 

As I mentioned on the 'phone the other day, the 
International Unix User's Group meeting planned for Mel¬ 
bourne later this year will be held on the weekend of the 
18th / 19th of October. These dates were the most preferred 
of those that were proposed, in fact there has been no 
request for any of the other dates at all. 


Times, venue, and program will be announced 
later, when I have a better idea of the number of attendees. 
As yet there have been very few replies to the request for 
people to let me know whether they intend coming or not. I 
would appreciate it if you could request the readers of 
AUUGN, (both local and overseas) who have not yet so indi¬ 
cated, to do so as soon as possible. Please emphasise that I 
would prefer a note indicating that the sender is not com¬ 
ing, or is uncertain to no reply at all, as I will then be 
able to form a more accurate estimate of the size of the 
meeting. 

In order that you may be able to induce some 
replies, I am enclosing a list of those who have replied, 
others may be shamed into doing likewise. 

The availability of accomodation at reasonable 
prices for interstate and overseas guests is currently being 
investigated. The prospects in this area have brightened 
slightly since our earlier conversation, 1 will send more 
details when available. Anyone who would be interested in 


AUUGN 


5 








such accomodation (either just for the UUG meeting, or 
stretching backwards into the previous week for IFIP) should 
NOT wait for such further information to come via the 
newsletter, but should contact me as soon as possible. 

On an entirely unrelated matter, prehaps either 
you or one (or more) of the readers of AUUGM could provide a 
solution to the following problem, which' is probably a UNIX 
bug, or if not, I would like to know where I have missed 
something. 


If an 'open' or 'close' routine of a device 
driver sleeps at low priority ( > PZERO ), and gets inter¬ 
rupted by a signal to the calling process, strange things 
happen. If an open routine is interrupted, UNIX regards the 
device as open, but returns error (EINTR) status to the pro¬ 
cess, which must assume that the open failed. (A C program 
will find it difficult to obtain the file descriptor in any 
case). If a close routine is interrupted, UNIX regards the 
device as closed, depriving the driver of the chance to 
clean up correctly (the process will almost certainly ignore 
the error indication here, and will regard the file as 
closed). In general no insurmountable problems occur, but 
the combination of a single use device (like a line printer) 
and a process, that tries to be intelligent about its use of 
that device (ie: doesn't just give up and exit when an open 
fails) and which uses signals to receive messages from other 
processes can cause difficulties. 

If anyone has a solution to this, or can suggest 
a suitable way to avoid the problem (especially allowing the 
close routine to tidy up and release resources) I would be 
grateful if they would let me know. 


Thanks for your help in publicising the Interna¬ 
tional Unix Users Group meeting. 


Yours 



Robert Elz. 






Replies to the request for information as to who will be attending the 
International Unix Users Group meeting in Melbourne in October have been 
received from: 


Piers 

Lauder, 

Comp Sci, Sydney Uni 

coming 

I. R. 

Perry, 

L.E.R.S.-Synthelabo, Paris, France 

coming 

Roger 

Miller, 

Architecture, UNSW 

coming 

J. Reinfelds, 

Comp Sci, Wollongong Uni 

coming 

Geoff 

Cole, 

Computing Centre, Sydney Uni 

coming 

Ken Yap, 

Sydney Uni 

coming 

A. L. 

Arms, 

Western Electric Co Inc 

coming 

David 

C Hunt 

Mathematics, UNSW 

coming 

G. W. 

Gerrity 

Military Studies, UNSW 

coming 

T. A. 

Dolotta 

Bell Labs 

possible 

E. Foxley 

Mathematics, Uni Nottingham 

not coming 

R. J. 

Wolff 

Inst Astronomy, Uni Hawaii 

not coming 

P. A. 

Lee 

Computing Lab, Uni Newcastle upon Tyne 

not coming 

Col in 

Taylor 

Westfield College, Uni London 

not coming 

R. P. 

A. Collinson 

. Uni Kent 

not coming 


I eagerly await more replies, 

Robert Elz. 
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The notes you have received are a summary of the Software Tools and UNIX 
meetings in Boulder, Jan 80. Please use them as appropriate in your 
newsletters. Copies have been sent to those persons below, which means 
that individual subscribers will receive multiple copies. So be it. Better 
that than none at all. 

Martin Tuori, Greg Hill, Ian Johnstone, David Tilbrook 


Bruce Anderson 

UK UNIX SIG Newsletter Editor 
Electrical Engineering Science 
University of Essex 
COLCHESTER C04 3SQ 
England 

J2 copies (including one for the Australian newsletter) to? 
Ian Johnstone 

Bell Labs / , ~ 

600 Mountain Ave. /€/&/' * 

Murray Hill 

New Jersey 07974 

USA 

Newsletter Editor 
Usenix Association 
Box 8, 

The Rockefeller University 
.1230 York Ave. 

!*ew York, NY 4002*1 
USA 

Neil Groundwater 

Software Tools Newsletter Editor 
Analytic Disciplines Inc. 

8320 Old Courthouse Road 

Vienna, VA 22480 

USA 

Newsletter Editor 
Login West 
PO Box 581 
Menlo Park 
California 94025 

USA 

f \ 

Mike Tilson 

Canadian UNIX Sig Newsletter Editor 
Human Computing Resources porp, 

40 St, Mary Street 
Toronto, Ontario 
M4Y IP9 




8 


AUUGN 




S^ccci 
53 3'5-fi 

3 a re o 

g§ 2 , 2 ,= 

u q, H H 


sopas: 

>— o w n> B 
a\f OS'S 

u s. 2 o _j 


“S - o3rrw»--=.—' 

“•“•srSsSSo. 0 * 
ob=-~ n> 3 E>3* ) Crt 

g<S3«r' ! u»t«3 
< E, rr. B m" 5 5 tn2 

5. CL r,'ZB. 
«e 2 3 S. 3 — 00 3 *-«- 
3 n 5- = Org|X- 
• S " E tr3 » 
33‘C a j3»oS 

^"“Smo’H.wS 
b 3 K - _ 2 . * 5' <* 

39^3j^p»' 
y|“ s.^s jg < s 
® 3-a ~ o o&S?- 
? n o ° 53 £2. tt 

C fTl M *“t , «« O 

StrS-.S-"^ 3 ?^ 

“■b-5 3 §■■§ * 

2 “" - "-1 ■-»• y—■ ,., *~n ® 

=• 3 B n» g - E o 1 ®? - 

9^3-,5B2S J 

& gQ.o’ 5 - tn 3" n> 

& & 5 -i § 3 £5 g. 

y 2 o.®‘ s So# 

- ~ 2*. b» 3 = „ 

o^S^agS,^ 

S’^iSSf 2 8 2 

■u 2 ” s- — S Q«< v 
25‘8!%^.»2 5 

“S " 2j § s 0^2 

00 o-o S* c ~ « 

§CC3-C «3 

s."« s 3 n Ig 5 ^ 
s 3 , ar.?s.= r S! 
s- 8 si , §^ii JS 

crrtw3qc;w^3* 

|§-3S." , fsr8« 

I a °-o “ 

o | -• ? ^ 2 v® i 

3 § 3 H 2i*3- 

2Cw3sB5;c« 

X Q jj* jj •— p ~“*o j. 

?8SoS&»gJ5 

n 5-’ 5 3 < ^ r 

2«ffCSs-2«S 
“■SffMSS'aStt 
3 * 3 . wW- sr3® 5 
b 0 7 » £5 o 

£.3 £ & % 5 « * 

ff.“.S Xfl? 38 


UUMwu W (j wuwk , u>im 

— 0 'OMniout,aumI;S 100 o v , 0 , w , a ““ - gOOD^CMnAUW.- 


~«-“' 0 »-J 9 vOI#kV)W- 


Jcnpti 

;Z» 9 2 , 

:' x a. 3 § i 

' O "0 c/5 § ' 

fia o 3 o 
J 3 ° 

'>g’23 | 

. 3 2 b *o 
! n. 3 a 3 

co o> y ■ 

3* 35 tw < ■ 

■ =• < O 2 r 

■-B 3 ; 3 3 J 

■ o 2 S S : 

I o' r* ’ 
3r* •“* gj 


> v> Z' "O 2 ; 

: > |s ? 

;w 2 B 2 . 

B H. S _ 

C 2 0 9 

0 0<j 3 O g 
3 

i 3 n£“ 
r 3 r* =• sa 

' Q<g « 

9 O s 3 

wo ‘i!. 
o »n 

o 

? W3 

O 2.0 

-r 

$ 


JJirno 

10 O 3 TJ 

a S sr ft 

n. 2 g 3 

is 8 S' 

S' 3 H » 

Sf 

la * 


7* 

© nx« 

02 ^ 2 2 S’ 5- cb » 1 

l?5^i , £i5|fi5S5 , ip2s*s; 

|«3|g!g ! g|!5g.|o|'||E:g>S.|i 

^85SN5-^iq5'§S?ll8g3oi 


’ 5 2 ~ 5? “2 eT s. 

Hn o 3 §- 2 T. ft t 

- _ W e — Pro < 3 
JT g w 2 £~m o s,. 
2 2.3-S p aj i 
<»r. »5o«J 
in' 2 3 3 & a,!rg 


AUUGN 


9 




' O 1) 

T: — C 


i E vi ~ | 

i. O « '3 ^ 

f u ^ > S! 

) u tJ J , 
; u u / O' 
’ E. o w S j 
* E c. o(j 
5 ,° — 2 ^ 
^ e 1 &- 

? v u o- ^ 

. *5 C t vr 
; o — ‘5 «•> 

:Ec-C"“ 

: < H JC 


'o yl - 

? U " 3 " 
: < C o sj 
' L- ■— V) CJ 

!w 3 3 i 
! c o "3 •£ 

• D C <= __ 

[fig I. 

! o -I c n 

1 > c 
: 2 o £*0 

| *** >< J3 2 

) >• 0 E 00 

, r J _ . 

• I 1 C *— 

• r-_i; QJ r3 


8 £ > o 

u ■£ 

S-* -a - 


X u. ■* 

O w- u 

ON f‘ CQ 

~ ^>8 
n o c 

« '■§ s 

s. « S 

c/i C J 


.r c ^ 

_* —' 0 

< .2 
5b 

zz c , 

u o jy 

<0 < • 


c c 2 ? 

a rj x: D« 


1 & .5 "2 S .5 

! 2 t £ S= 3 

L-, a w. a) c 

^ Ll o j= S 

S E w a. 
a!,o j <u c 

H < 5 « 

•o -O U 
-r: o> 

S Q 


10 


AUUGN 


standard utilities should be available throughout the network: a! the programming level, a single set of 
system calls and primitives within Ratfor should be available throughout the network. This creates a 
virtual operating system which can be overlaid on any host in a network, in which system services 
(utilities and system calls) are consistent. So far this has been implemented under VMS. RSX. UNIX 
and TENEX. 
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0MN1X is u Z80-based system which supports pipes, I/O redirection, a shell, but no C compiler yet. It 
is NOT a UNIX, but is similar. It is multi-user, and supports a UNIX filesystem. It can run CP/M 
binaries, as it supports the CP/M system calls. It handles a range of peripherals, and sells for S350. 
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14) text.c has a parenthesis problem; running lint -h on it will show it up. 
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The user can operate VERT in a variety of modes. One key at a time input mode can be used for echo, 
and to determine variations in keyboard layout (hunting). Word at a time has normal uses, while no 
echo on input is used for entering large amounts of text. Output modes vary as well, depending on the 
context. The unit can be set to buffer a character at a time, word, line, or paragraph. It has been 
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ABSTRACT 

This paper presents models and reasons for 
native UNIX performance. Some figures are given 
for real systems. Some improvements are men¬ 
tioned. The purpose of this paper is to give in¬ 
stallations a tool for evaluating the performance 
of their systems and sufficient information to 
know when their system has reached full capacity. 

The results presented in this paper are based on 
years of analysis of the UNIX systems at the 
University of California at Berkeley. These 
results are generally applicable though the order 
of importance may change. 
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To begin with, I feel that is important to observe that 
there seem to be two different definitions of ,% time shar¬ 
ing''. One definition implies that time sharing is the 
sharing of resources by users which would otherwise not be 
fully utilized by a. single user. The other definition im¬ 
plies only that resources are shared regardless of whether 
the resource is already fully utilized. 

The consequences of adopting either of these defini¬ 
tions will become clear in the pages ahead. 

Performance can be measured in terms of I/O, swap, 
comutation, and overhead. I/O is defined as the amount of 
blocks tranferred to and from the file system i.e. exec 
blocks and blocks read or written by the user. Swap is de¬ 
fined as the amount of movement of processes in and out of 
main memory. Computation is defined as the amount of cen¬ 
tral processing time being used by the user community. 
Overhead is defined as the amount of central processing time 
being used by the operating system. 

In the following sections, the four areas of perfor¬ 
mance will be looked at In-depth. 

5 Mag 

Excessive swapping degrades a system more quickly than 
any other area of performance presented in this paper. This 
Is tne reason for discussing swapping first. 

There are three basic levels of swapping. The first 
and easiest case to look at is no swapping. If processes 
are not being swapped in and out of main memory then 
processes execute at the speed of the memory and are 
governed by the scheduling of the central processing unit. 
Thus, a performance curve (execution time of a single pro¬ 
cess. versus the number of simultaneously executable 
processes) would be a linear increase in time as the number 
of simultaneously executable processes increases. 

The next level of swapping is sometimes referred to as 
easy swaps. (An easy swap Is when a sleeping process can be 
swapped out and an executable process can be brought in.) At 
this level of swapping, the cost of a swap (e.g. the time to 
reload a process into main memory) grows from small to a 
large factor in comparison to the process's life time. As 
the number of swaps per time period increases, the cost to 
each process grows as the size of the process times some 
factor over the execution time allowed between swaps. 
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A few typical numbers for the factor mentionaed above 
are given below: 

RS04 4*0 + 0.66 = 4.66 
BM03 1*6 + 1.16 = 2.76 
RP06 2.5 + 1.16 = 3.66 

All numbers are in microseconds 

r 

.. The first number is the typical tranfer time eof a word 
from these disk systems to main memory. The second number 
is typical arm movement latency and rotational delay calcu¬ 
lated on a per word transferred basis. 

In general, the number of swaps in a time period will 
be a function of the number of simultaneously rexecutable 
processes. 

Thus/ the performance curve changes slowly from the 
slow rising line of simply sharing the central processor to¬ 
wards a rapidly rising line governed by the factors shown 
above. 

At some point as the number of simultaneously' execut¬ 
able processes increases/ there will be no sleeping process 
in main memory to throw out. So executable process will 
have to be thrown out of main memory to allow otruer execut¬ 
able processes a chance to be swapped into maim memory. 
This is sometimes referred to as hard swapping. 

Since processes are guaranteed to be in main memory for 
a certain amount of time, once in, there will be a limit to 
the numoer of swaps per time interval. Thus, performance at 
this level again depends linearly on the number of simul¬ 
taneously executable processes but at a much accelerated 
rate. 

Mathematically/ these levels can be viewed as the fol¬ 
lowing equation: 


Time = (factor * size) * swaps(N) + (execution time)*N 

where N is the number of simultaneously executable processes 

swaps(N) = f(n>MAX ? Cl-MAX/n) : 0) 

+ CN>MAX ? Cl-N/MAX) : 0)) * U 

uh a r a MAY i e ^ h a m ja { m 11 m mimhar n + n Ttftf P < a c in i n mP’TtmV* 

W 4 is.4 ^ l"i jn A U V.il'. iuu<\ ill! uai numw W 4 W ^ fc vw w -*»•** ^ — ... w * X , 

where U is the characteristic usage of the system 
and where n is the number of processes 


The following figure Is a model based on a simpler version 


- 
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of this equation. The two lines in this figure represent 
the assuratotoes to the expected performance curve. 


EXECUTION TIME FOR PROCESSES 



AMPVX 


Figure 1 


A few examples at this point will help to clarify the 
issue of swap performance and show validity to the models 
and reasons given above. Below is a performance curve for a 
UNIX system running on a POP 11/70 with a RS04 fixed head 
disk for swapping and another RS04 for the root file system. 
Both of these drives are on the Massbus. The user file sys¬ 
tems are on several. Cal. Comp Model 615 drives. These drives 
28 AUUGN 








tion tripled the amount of memory available for user 
processes. Below is the performance curve for this system 
after the addition. 



Figure 3 


There are two important differences between these per® 
formance curves that should be noted. The swap degradation 
has vanished. Measurements of this system showed that be¬ 
fore the additon of memory? the swap scheduler was running 
100% of the time at maximum user load. (Note that load was 
a function of what users were willing to put up with as far 
30 AUUGN 
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as response time was concerned.) After the addition of 
memory# measurements showed that the swap scheduler was run¬ 
ning less than 10% of the time. This activity can be ex¬ 
plained by the movement of shared text from the swap disk to 
main memory. 

The other difference to note is that the performance 
curve has moved down along the time axis in figure 3 even 
though the slope remained the same. This change is due to 
the fact that the new memory was almost twice as fast as the 
old memory. 

Even though the POP 11/70 has a cache# the increase in 
the speed of the main memory had a number of effects. Cache 
misses take less time. Direct memory accesses take less 
time to complete. The central processor could get more 
memory cycles between direct memory accesses. 

The left side of the performance curves of figures 2 
and 3 start out together. The reason for this similarity is 
due to file system performance. (File system performance 
will be discussed shortly.) 

From these two examples# we can see that the amount of 
main memory and the speed of the main memory are very impor¬ 
tant to system performance. 

Even though these figures are based on the characteris¬ 
tics of the users of this system, the model still applies to 
any system. Also remember that the performance curve will 
be pushed up the time axis by a factor of two approximately 
for systems without a cache. 

In general# the amount of main memory for best perfor¬ 
mance is- the number of simultaneous users (the number of 
ports is adequate) times the average size of a process plus 
the size of the operating system (a number to start with is 
32k bytes per process). 

In closing# I would like to share two other pieces of 
information. 

If you are not fortunate enough to have enough memory 
for your user load# you may wish to look closely at the al¬ 
gorithm for the swap scheduler. This algorithm has the ugly 
feature of allowing small processes to stay in main memory 
almost indefinitely. These small processes will have the 
tendency to fragment main memory and cause a loss of avail¬ 
able memory for user processes. This ugly feature can be 
partially corrected by adding in the time a process has teen 
In main memory times a factor (8 is fine) to the size of a 
process during the determination of the largest processes 
that can be easily swapped out. 
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Measurements of memory fragmentation show that 10 per¬ 
cent of the memory is lost to fragmentation anyway. 

The question of a fixed head disk versus a moving head 
disk for swapping is commonly asked. Let me answer this 
question with the following measured facts. I have yet to 
see a moving head disk drive transfer continuously at a rate 
above 100 blocks per second when used as a swap device. 

I have seen a fixed head disk Cof a similar transfer 
speed) transfer continuously at 500 blocks per second when 
used as a swap device.. The only conclusion I can draw from 
these measurements is that any seeking destroys system per¬ 
formance. 

In conclusion, I would say from- my measurements that 
swapping should be avoided for good performance. 


LIFE OF A PROCESS 
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Figure 4 
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A proper perspective on computation and file system ac¬ 
tivity is needed before considering performance effects in 
these areas. The following discussion and figure should 
provide an adequate perspective. 

Let us for a moment consider the life of a process. 
Birth usually begins by a fork system. Since forks depend 
upon memory: space or swapping, the speed of a fork is deter¬ 
mined by swapping activity. A fork is usually followed by an 
*exec # system call. An exec loads the image for a. process 
either totally from the file system or partially from the 
swap disk and partially from the file system. Beyond this 
point, the life of a process is determined by the user. The 
user may or may not read and/or write data in the file sys¬ 
tem. The user may or may not do a considerable amount of 
processing. Hopefully at some point, the process will exit 
via one method or another. 


As can be seen from figure 4 above, we must look at 
exec's as well as user file system activity to see the whole 
picture on file system activity. 

Ella. System, - 

The first file system activity from a process will be a 
path search- The basic file system activity during this 
search is 1 seeking between the storage location of the inodes 
and the storage location of the blocks that make up the en¬ 
tries of a directory. This search takes more time as the 
file system grows in depth and bulk. 

Path searching may be done rarely but it is expensive 
in its use of the disk system. Searching is sped up by hav¬ 
ing the directory inodes already in the inode cache. The 
directory may already be in the inode cache If more than one 
process is using it. 

The probability that a directory inode is in the inode 
cache is small due to the rarity of searches except if the 
directory is the current directory. 


To improve on patn searching performance, a number of 
approaches have been tried with varying success. One ap¬ 
proach is to open directories used frequently by a program 
like 'init'. Another approach is to have the 'set user id # 
bit mean that the inode is to be locked in the inode cache 
on first use until the device is unmounted. 


Measurements of directory inode activity show that 50 
percent or more of all activity is through /, /bin, /lib, 
/dev, and /tmp and only 50 percent or less is through the 
rest of the file systems. 
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These solutions are not elegant but do provide some im¬ 
provement in file system performance. The inode caching al¬ 
gorithm itself may need to be improved if a more elegant 
solution is desired. 

Once the exec has the file from which the image is to 
De loaded, there are two possibilities. Either all or part 
of the image is read from the file system. If part of the 
image is loaded from the swap disk, the speed at which it is 
loaded is determined by swap activity. 

How quickly the image loads from the file system 
depends on its linearity and the effectiveness of the read- 
ahead. The linearity of the image is dependent upon the 
linearity of the free list of that file system and the speed 
at which the free blocks are used. 

Only in version 7 has anything been done in this area 
to ensure linearity on a dynamic basis. Occasionally for 
version 6 systems, static but very active file systems 
should be dumped and restored on a freshly made file system 
to improve the linearity of files commonly used. 

The effectiveness of the read-ahead in UNIX systems 
depends on the buffer cache performance. Specifically, the 
number of buffers versus the number of users governs the 
read-ahead effectiveness. In order for read-ahead to be ef¬ 
fective, there must oe two buffers available: one for the 
currently needed block and one for the read-ahead block. In 
fact, if insufficient buffers are available, an attempt to 
do read-ahead may delay the currently needed block from be¬ 
ing read in. This relationship between the number of 
buffers and the number of users doing file system activity 
is illustrated below. 
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Figure 5 


As a rule of thumb, the number of buffers a system 
needs is twice the number of simultaneous users to guarantee 
effective use of the read-ahead and good file system perfor¬ 
mance. 

Note that all of the above also applies to user file 
system activity. 

r liffpr- Cache Bed hiam 

The loading of an image during an exec system call un¬ 
fortunately has a negative effect on user file system per¬ 
formance. 

In original version 6 systems, the Image being loaded 
would cause every buffer to be used because the buffers were 
put on the end of the buffer free list. Thus, any partial 
file system activity by other processes will be flushed from 
the buffer cache. Read ahead blocks for other processes are 
also flushed out as well. 

This problem may have been fixed in version 7 though 
the solution chosen needs further investigation. The main 
Idea of a solution is to put buffers unlikely to be used 
again at the beginning of the buffer free list. Definition 
of unlikely is usually based on the same assumptions that 
read ahead is. Thus, a buffer whose contents has been read 
or written to a block boundary is assumed not to be used in 
the near future. 
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.To get a feeling for file system performance, let me 
throw out a few figures. 

Measurements show that only 1.5 to 1.6 blocks are 
transferred per seek. This measurement coincides with the 
hit rate on the buffer cache of 50 to 66 percent. These 
measurements might mean that the only other block read 
between seeks is its read-ahead block, if that. 

Obviously since blocks per seek is small, the time per 
seek becomes important. A typical time appears to be 20 
milliseconds. These measurements are supported by the fact 
that file system activity only amounts to 10 to 20 blocks 
per second maximum. This rate is in contrast to 100 blocks 
per second for moving head swap devices — a factor of 10. 

There are a few other ideas for improvements in this 
area I would like to comment on below. 

The question has been asked if the root or tmp might be 
better off placed in main memory as a special file. This 
idea is only reasonable if there is more memory available 
above and beyond what is needed for orocesses to run without 
excessive swapping and if there is lots of central process-* 
ing time to waste. There is so little processing time 
available on most systems that this improvement is only 
feasible for special purpose systems. 

Another improvement to accommodate more buffers is to 
overlay suffer space or move the buffers totally out of ker- 
nel main memory. Here again, it is important that there be 
no loss of main memory available for user processes. Over¬ 
laying buffer space is not expensive as far as central pro¬ 
cessing time but tnis scheme is yet unimplemented. Moving 
the buffer space totally out of main memory will cost con¬ 
siderable amount of central processing time or delay in pro¬ 
cessing interrupts. 

In conclusion, I would like to suggest that systems 
should fit on suitable hardware rather than playing games to 
make it fit on a smaller machine. Good performance will 
only come from a system with proper hardware. 


A user may use as little or as much central processing 
time as may be needed. Obviously, central processing time 
can be used 100 percent of the time by a process which just 
loops forever. The main objective in this area of perfor¬ 
mance is to share the central processing unit fairly. The 
question is what is fair and what will fairness cost in tne 
area of operating system overhead. 

Even to consider scheduling the central processing unit 
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fairly# some mechanism is needed to interrupt the process 
which has been using the central processor. Rescheduling is 
currently requested during clock interrupts — once a second 
— and during interrupts for completion of transfers by dev¬ 
ices. Rescheduling is not done unless this interruption oc¬ 
curs while the user process is running. Thus#, the percentage 
of time the system is in user mode is important to how often 
the central processor is rescheduled. 

Mathematically# the probability of rescheduling is the 
product of the probability that the system is in user mode 
and the probability that a request for rescheduling is pend¬ 
ing. 

PCreschedule) = PCuser) * P(reauest) 


For now# let us harshly consider rescheduling prooabil- 
ity as a measure of fairness. Then fairness should increase 
if tne amount of 'user time' available is increased or if 
the number of requests is increased. In the first case# 
user time Is gained by reduction of operating system over¬ 
head.- Thus# any solution which increases overhead Is of 
marginal# If any# benefit. In the second case® there are 
two ways to increase the number of requests for reschedul¬ 
ing. One way to increase requests is to increase the number 
of completion of devices somehow. This approach increases 
operating system overhead so it is rejected as a solution. 
The second way is to increase the number of rescheduling re¬ 
quests generated by the clock interrupt. Since the clock 
Interrupts sixty times a second# rescheduling requests can 
be done more often than once a second. The overhead here is 
nominal since the state of the process has already been 
saved and context switching Is relatively inexpensive. 

In general# measurements show that most systems spend 
about 50 percent of its time running user processes and 50 
percent of its time doing operating system work® This means 
that rescheduling can only happen 50 percent of the time at 
most* Measurements show that rescheduling is requested 
about 10 to 20 percent of the time at most. 

In fact# clock Interrupts can only make rescheduling 
requests 1 percent of the time; disk request completions can 
only make rescheduling requests 33 percent of the time, and 
terminal I/o completion can only make rescheduling requests 
66 percent of the time. 

In conclusion, I see the best way to improve % ’fair¬ 
ness'' in sharing the central processor is by increasing the 
number of requests for rescheduling via the clock interrupt. 

I have experimented in this area. I have tried twice a 
second and ten times a second. Subjectively, there seems to 
be better response time when competing with compute bound 
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jobs. I might suggest that the number of context switches 
per second depend on the percentage of user time available. 

If a reason is necessary for such improvements then 
consider the following. Rescheduling of the central proces¬ 
sor because of device completion is based on the priority of 
the previous operating system interactions. The clock In¬ 
terrupt provides a eonvient place to" sample priorities of 
the users and rotate the queue of equal priority processes. 

G&arhaad 

Operating system overhead can be usually improved by 
optimization of code frequently used. Profiling a system Is 
an excellent way to determine which code is frequently, exe¬ 
cuted. Interrupts and system calls are invariably heavily 
used because they are the start of most transactions that 
take place in the operating system. 

Measurements at University of California at San Fran¬ 
cisco showed that system calls take at minimum of 320 mi¬ 
croseconds. Thus# the more system calls done; the more 
•overhead Incurred. To reduce overhead, system calls should 
be done as Infrequently as possible. Terminal i/o Is usual¬ 
ly a sore point for the number of system calls. 

Interrupts affect system overhead In several ways. In¬ 
terrupts block out other interrupts at this level and lower. 
Interrupts steal central processing cycles. To reduce sys¬ 
tem overhead in this area, the number of interrupts as well 
as the duration of the interrupts must be reduced. The 
number of interrupts depends on the mode of transfer and the 
reason for the interrupts. Terminal i/o Is the most fre¬ 
quent Interrupt In most systems. The number of Interrupts 
can be reduced by FIFO'ing the data In bursts or using 
direct memory access devices. A common reason for high sys¬ 
tem overhead is unterminated lines and noise. Login 
processes* can create a feedback loop with open lines and 
cause continuous interrupts. 

Measurements of terminal i/o shows that 90 percent of 
all transfers are output and only 10 percent are Input. 
Thus optimization in the area of terminal output may reduce 
system overhead. One approach to improving character by 
character output Is called "pseudo dma' # . The idea here is 
to simulate tnc direct memory access device. In other 
words, each terminal has a pointer to some storage space and 
the number of characters at this location. This metnod pro¬ 
vides a quick way to get the character to be outputted and 
return from Interrupt. 

Most Interrupts can be handled simoly and usually do a 
wake up of some process. The wake up itself becomes the 
chief expense of the interrupt, wake ups are expensive be- 
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cause they do a linear search through the entire process 
table looking for appropriate processes to be put on the run 
queue. There are two approaches to reducing tne search 
through tables. These optimization methods can be used 
through the operating system. One approach is to keep track 
of the last entry in the table and only search this far. 
This approach will have a large effect if only a small por¬ 
tion of the table is used. 

The second approach is to drop table searches and use 
queues. Measurements show that most of the time only a sin¬ 
gle process is waiting on an event. These results support 
the idea of using queues. 1 have not heard of or implement¬ 
ed myself this scheme but it might be worth trying. 

Another area of system overhead is copying data from 
user space to kernel space or kernel space to user space. 
These copies are expensive operations. During these copies 
interrupts are not allowed. Interrupts can be allowed in 
this section of code it the supervisor registers are not al¬ 
tered during any interrupt. 

. There are a number of improvements that can be made to 
the operating system to reduce overhead. Eacn improvement, 
of course, will have a varying amount of reduction of over¬ 
head depending on how the system is used. 

In closing this section, let me say that a more sub¬ 
stantial improvement can be noted in all areas with in¬ 
creased memory speeds even though the relative percentage 
may not change. 

fond ns i.Qa 

In closing, let me say that the basic principles of 
good performance have been around for a wnile. The reason 
it may seem new is that we have been exploring in new areas 
and nave let past experience in this matter slip away. The 
only surprizes in this paper may be the models and the fig¬ 
ures. 

I hope that UNIX installations will now upgrade there 
performance to match the brilliance of tne system under¬ 
neath. 


I would like to thank the following people for sharing 
tne results of their measurements or helping me collect in¬ 
formation: Professor Fabry, University of California at 
Berkeley; Bob Kridle, University of California at Berkeley; 
Tom Ferrin, University of California at San Francisco. 
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ABSTRACT 

This paper describes a modified version of the UNIX operating system that 
supports virtual memory through demand paging. The particular implementa¬ 
tion being described here is specific to the vax*-1 1/780 computer system 
although most of the design decisions have wider applicability. 

The modified system creates a large virtual address space for user pro¬ 
grams while supporting the same user level interface as UNIX. The few new 
system calls that have been introduced are primarily aimed for performance 
enhancement. The paging system implements a variant of the global CLOCK 
replacement policy (an approximation of the global least recently used algorithm) 
with a working set like mechanism for the control of multiprogramming level. 

Measurement results indicate that the lack of reference bits in the VAX 
memory management hardware can be overcome at relatively little expense 
through software detection. Also included are measurement results comparing 
the virtual system performance to the swap based system performance under a 
script driven load. 
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1. Introduction 

The most significant architectural enhancement that the vax-1 1/780 provides over its 
predecessor, the PDPMl, is the very large address space made available to user programs. The 
.fundamental task of transporting UNIX to this new hardware was accomplished by Bell Labora¬ 
tories at Holmdel. In addition to the portability directed changes, the memory management 
mechanism of the base system was modified to make partial use of the new hardware. In par¬ 
ticular, through these changes, processes could be scatter loaded into memory thus avoiding 
main memory fragmentation, and swapped in and out of memory partially. A process, however, 
still had to be fully loaded in order to execute. While no longer limited by the 16 bit address 
space of the PDP-11, the per process address space could grow only as large as the physical 
memory available to user processes. This system, which constituted a prerelease of UNlx/32Vf, 
was adopted as the basis for virtual memory extensions. 

The virtual memory effort was motivated by several factors in our research environment: 

* To provide a very large virtual address space for user processes, in particular, Lisp sys¬ 
tems such as macsyma, and other systems employed in Image Processing and VLSI 
design research. 

* To provide an easily accessible virtual memory environment suitable for research in the 
fields of storage hierarchy performance evaluation and automatic program restructuring. 

* To try to improve overall system performance by making better use of our very limited 
memory resource. 

The reader should be familiar with the standard UNIX system as described in [RITC 74] 
and the virtual memory concept in general [DENN 70]. In the next section, we briefly describe 
the memory management hardware as it exists in the vax-11/780 to support virtual memory 
[DEC 78]. The following sections detail the new kernel operations including new system calls 
followed by various measurement results. 


t UNIX and UNIX/32V are Trademarks of Bell Laboratories 

♦ Work supported by the National Science Foundation under grants MCS 7807291, MCS 7824618, MCS 
7 i 4Q7644“AQ3 srid b u an IBM Or sdusts Fellowship to the second suthor. 

• VAX and PDP are trademarks of Digital Equipment Corporation. 
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2. VAX-11/780 Memory Management Hardware 

The VAX-ll/780 memory management hardware supports a two level mapping mechanism 
to perform the address translation task. The first le d page tables reside in system virtual 
address space and map user page tables. These tables in turn, map the user virtual address 
space which consists of 512 byte pages. The 32 bit virtual address space of the vax-11/780 is 
divided into four equal sized blocks. 



Fig. 2.1. Virtual address space 

Two of these blocks, known as the PO and PI regions, constitute the two per process segments. 
The third block, known as the system region, contains the shared kernel virtual address space 
while the fourth region is not supported by the current hardware. The PO segment starts at vir¬ 
tual address 0 and can grow toward higher addresses. The PI segment on the other hand, starts 
at the top of user virtual address space and grows toward lower addresses. Both segments are 
described by two per process (base, length) register pairs. 

A page table entry consists of four bytes -of mapping and protection information. 
Attempting a translation through a page table entry that has the valid bit off results in a Transla¬ 
tion Not Valid Fault (i.e., a page fault). Whereas most architectures that support virtual 
memory provide a per page Reference Bit that is automatically set by the hardware when the 
corresponding page is referenced to be examined and/or reset by the page replacement algo¬ 
rithm, the VAX-ll/780 has no such mechanism. In order to overcome this deficiency, we distin¬ 
guish the three states that a page of virtual address space can be in: 

[1] Valid — the page is in memory and valid. This corresponds to the valid, referenced state in 
the presence of a reference bit. 

[2] Not valid but in memory — the page is in memory but the page table entry is marked not 
valid so as to cause a page fault upon reference. This is the so called reclaimable state of 
the page. Equivalent to the valid, not referenced state. 

[3] Not valid and not in memory — the page is in secondary storage. Equivalent to the not 
valid state. 

o 

This scheme in effect allows us to detect and record references to pages using software. 
We discuss the cost and effectiveness of the method in §7.2. 
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3. Process Structure 

In UNIX, the notion of a process and a computer execution environment are intimately 
related [THOM 78]. In fact, a process is the execution of this environment which consists of 
the process virtual address space state, general register contents, open files, current directory, 
etc. The state of this pseudo computer is comprised of the contents of four segments. The 
first three contain the process virtual address space, while the fourth segment describes the sys¬ 
tem maintained state information. 

The process virtual address space consists of a read only program text segment that is 
shared amongst all processes that are currently executing the same program, as well as private 
writable data and stack segments. Within the limited segmentation capability of the vax-11/780, 
these three segments are mapped such that the program text is in the PO region beginning at 
virtual address 0 with the data immediately after it starting at the next page boundary. The 
stack segment is mapped into the PI region starting at the highest virtual address. While the 
text segment has a static size, the data segment can be grown or shrunk through system calls 
and the stack segment is grown automatically by the kernel upon the detection of segmentation 
faults. 

The physica} structure of these segments in secondary storage (called an image) can be 
organized in various ways. At one extreme is the physically contiguous organization described 
simply by a (base, length) pair. While appropriate for static segments, such as text, this organi¬ 
zation is too rigid for dynamically growing segments, like the data and stack segments. In addi¬ 
tion to fragmentation, segment growth beyond the current allocation could imply physical 
movement of the image. At the other extreme is a fully scattered organization of the image. 
f -While minimizing fragmentation, this can result in expensive allocation and mapping functions 
due to the large number of pages which are present in large images. 

The image organization chosen for the dynamic segments represents a compromise 
between the two extremes. Each image consists of several scattered chunks. An exponentially 
increasing chunk allocation scheme allows the mapping of very large segments through a small 
- table. Limiting the maximum size of any chunk helps to prevent extreme fragmentation. Thus 
in the current system, the smallest chunk allocated to a segment is 8K bytes, and chunk sizes 
increase by powers of two up to a maximum size of 2M bytes. 





4. Kernel Functions 

We now describe the major new functions performed by the kernel as well as the existing 
functions whose implementation have been significantly modified. For the purposes of future 
discussion, we define the following terms: 

Free List The doubly linked circular queue of page frames available for allocation. Allo¬ 
cation is always from the head, while insertion occurs both at the head (for 
pages which can no longer be needed) or the tail (for pages which might still 
be reclaimed). 

Loop Envision the set of physical page frames that are not in the free list as if they 

were arranged statically around the circumference of a circle. We refer to 
these set of page frames, ordered by physical address, as the loop. Page frames 
allocated to kernel code and data appear in neither the loop nor the free list. 

Hand A pointer to a page frame that is in the loop. The hand is incremented circu¬ 

larly around the loop by the pageout daemon as described below. 

4.1. Page Fault Handling 

The most visible of the kernel changes is the existence of a Translation Not Valid fault 
handler. Given the virtual address that caused the fault, the system checks to see if the page 
containing the virtual address is in the reclaimable state. This happens when the pageout 
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daemon has swept past a page and made it reclaimable to simulate a reference bit (as described 
below). If the page is in this state, it can once again be made valid, and the process returns to 
user mode. Note that if the reclaimed page was in the free list, it is removed and reenters the 
loop. Since none of the operations involved in reclaiming a page can cause the process to block , 
reclaiming a page does not involve a processor context switch and reschedule. 

If the page cannot be reclaimed (i.e., is not no longer in core), then a page frame is allo¬ 
cated and the disk transfer is initiated from the segment image as dictated by the image map¬ 
ping. 

In reality, more cases must be considered. If the faulting page belongs to a shared text 
segment, the disk transfer is initiated only if the page is not reclaimable and not intransit , i.e., 
the pagein operation has not already been initiated by another process that is sharing the text 
segment. If intransit, the faulting process sleeps to be awoken by the process that started the 
page transfer when it completes. Here we note that the first level page tables for shared text 
segments are not shared, but rather, each process has its own copy.t Thus, all operations that 
modify page table entries of shared text segments must insure that all existing copies are 
updated. 

Other types of page faults that require special handling are those where the faulting page 
is marked as fill on demand. There are three types of demand fill pages: 

Zero Fill These pages are created due to segment growth and result in a page of zeroes 

when referenced. 

, Text Fill These pages result from execution of demand-loaded programs, and cause the 

corresponding page to be loaded at the point of first reference, from the 
currently executing object file. Such object files are created by a special direc¬ 
tive to the loader and are described further in §5.3. 

File Fill These pages are similar to text fill pages, but the pages come from a open file 

rather than the current text image file. These pages are set up by the vread 
system call. See section §5.2 for more details. 


4.2. Page Write Back 

During system initialization, just before the init process is created, the bootstrapping code 
creates process 2 which is known as the pageout daemon. It is this process that actually imple¬ 
ments the page replacement policy as well as writing back modified pages. The process leaves 
its normal dormant state upon being woken up due to the memory free list size dropping below 
an upper threshold. 

At this point, the daemon examines the page frame being pointed to by the hand. If the 
page frame corresponds to a valid page, it is made reclaimable. Otherwise the page was 
reclaimable, and it is freed, but remains reclaimable until it is removed from the free list and 
allocated to another purpose. The hand is then incremented and the above steps are repeated 
until either the free memory is above the upper threshold or the angular velocity of the hand 
exceeds a bound. 

The rate at which the daemon examines page frames increases linearly between the free 
memory upper threshold and lower threshold (these are tunable system parameters). In a 
loaded system the hand will be moved around the loop two to three times a minute. 

Upon encountering a reclaimable page that has been modified since it was last paged in, 
the daemon must arrange for it to be written back before the page frame containing it can be 
freed. To accomplish this, the daemon queues a descriptor of the I/O operation for the paging 
device driver. Upon completion of the I/O, the into.rupt service routine inserts the descriptor 


t Sharing all user level page tables of shared segments would require a 64K byte alignment between the text 

~~a TKir Jo not onTnoroH h \t ttip mrrptit Inariino scheme sn currently these naee tables are 
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not shared at all. 



AUUGN - 45 






- 5 - 


to the cleaned list for further processing by the daemon. The daemon periodically empties the 
cleaned list by freeing the page frames on it that have not been reclaimed in the meantime. 

Note that as described above, this pageout process implements a variant of the global 
CLOCK page replacement algorithm that is known as scheduled sweep [CORB 68, EAST 79]. 

4.3. Process Creation 

In UNIX, every process comes into being through a fork system call whereby a copy of the 
parent process is created and identified as the child. This involves the duplication of the 
parent’s private segments (data and stack) and the system maintained state information (open 
files, current directory, etc.). 

Within a virtual memory environment including the pagein and pageout primitives 
described above, the implementation of the fork system call is conceptually very simple. The 
parent process copies its virtual address space to the child’s one page at a time. Note that this 
may require faulting in the invalid portions of the parent’s address space. Since the VAX-11/780 
memory management mechanism can establish only one mapping at a time, the child’s address 
space is actually created indirectly through the kernel virtual memory. 

Although conceptually simple, the above scheme has undesirable system performance 
consequences. Duplication of the parent’s private segments generates a sharp and atypical con¬ 
sumption of memory. Since a significant percentage of all forks serve only to create system 
contexts to be passed to another process via the exec system call, the copying of the parent’s 
f .private segments is largely unnecessary. The vfork,system call, described in §5.1, has been 
. introduced to provide an efficient way to create new system contexts within the current design. 

4.4. Program Execution 

The exec system call, whereby a process overlays its address space also has a simple 
implementation. The process releases its current virtual memory resources and allocates new 
ones as determined by the program being executed. Then, the program object file is simply 
read into the process address space which has been initialized as zero fill on demand pages so as 
not to generate irrelevant paging from the process’ old image. 

This implementation suffers from the same problems as the above fork implementation. 
Initiation of very large programs is very slow, and results in system wide performance degrada¬ 
tion due to the loading of the entire program file in the virtual memory before execution com¬ 
mences. A new loader format which allows demand paging from the program object file has 
been designed to improve large program start up and to eliminate this non-demand situation 
(see §5.3). 

4.5. Process Swapping 

Swapping a process out involves releasing the physical memory currently allocated to it 
(called the resident set) and writing back its modified pages to its image along with the system 
maintained state information and page tables. Swapping a process in, on the other hand, 
involves reading in its page tables and state information and resuming it. Note that as no pages 
from the process address space are brought in, the process will have to fault them back in as 
required. The alternative of swapping the resident set in and out is not implemented. 

4.6. Swap Scheduling 

When the amount of available free memory in the system cannot be maintained at a 
minimal number of free pages by the pageout daemon, then the system invokes the swap 
scheduler. In order to free memory, the swap scheduler will select a process which is resident 
and swap it (completely) out. The scheduler prefers first to swap out processes which have 
been blocked for a significant iengtn of time, and chooses the process which has been in such a 
state the longest. If there are no such processes, and n is therefore necessary to swap out a 
process which is or has recently been active, the system chooses from among the remaining 
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processes the one which has been memory resident the longest. 

In choosing an active process to swap out, the system checks t© guarantee that the process 
has had a minimal amount of time necessary to demand fault in the number of pages which it 
had wh' i it was last swapped out (based on maximum expected paging device throughput). 
This ser es to guarantee a minimal amount of progress by a process each time it is swapped in. 
When a process is forced out while it has many pages, it is given lower priority to return to the 
set of resident processes than ones which swapped with fewer pages ©r which are very small. 

The swap-in mechanism also recognizes that processes whidh swapped out with many 
pages, will need to fault in pages when they are brought in. The system therefore maintains a 
notion of a global memory deficit, which is the expected short term demand for memory from 
processes recently brought in, based on the number of pages they were using when they 
swapped out. The deficit is charged against the free memory available when deciding whether 
to bring a process in. 

In general, this swap scheduling mechanism does not perform well under very heavy load. 
The system performs much better when memory partitioning can fee done by the page replace¬ 
ment algorithm rather than the swap algorithm. If heavy swapping, iis to occur on moving head 
devices, then better algorithms could be implemented. High speed specialized paging devices, 
on the other hand, would suggest different algorithms based on migration. 


4.7. Raw I/O 

In a virtual memory environment, handling input/output operations directly to/from pro¬ 
cess address space without going through the system buffer cache requires special attention. 

/ The pages involved in the I/O must be insured to be valid and locked for the duration of the 
operation. This is accomplished through the virtual segment locMunlock internal primitives. 
Locking a virtual segment consists of locking pages that are already valid and 
faulting/reclaiming invalid pages by simply touching them and refraining from unlocking the 
page frame (which is allocated in the locked state) after the pagein oompletion. 


5. New System Facilities 


5.1. Process Creation 

In order to allow efficient creation of new processes, a new system primitive, vfork has 
been implemented. After a vfork, the parent and its virtual memory run in the child’s system 
context, which may be manipulated as desired for the new image to he created. When a exec is 
executed in the child’s context, the virtual memory and parent thread of control are returned to 
the parent’s context and a new thread of control and virtual memory are created for the child. 
This mechanism allows a new context to be created without any copying. 

It should be noted that an implementation of fork using a Copy On Write mechanism 
would be completely transparent and nearly as efficient as vfork. Such a mechanism would rely 
on more general sharing primitives and data structures than are present in the current version 
of the system, so it has not been implemented. 


5.2. Virtual Reading/Writing of Files 

In order that efficient random access be permitted in a portaMe way to large data files, a 
pair of new system calls has been added: vread and vwrite. These calls resemble the normal 
UNIX read and '•rite calls, but are potentially much more efficient for sparse and random access 
to large data files. Vread does not cause all the data which is virtually read to be immediately 
transferred to the users address space. Rather, the data can be fetched as required by refer- 

nlr tVto, nrotamp A i o nrcxf i r\ rn At t Vi A r\/~\i n t r\ F tVlP VTPoH thp mPFClV COITIDUtCS tllS 
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disk block numbers of the corresponding pages and stores these in the page tables. Faulting in 
a page from the file system is thus no more expensive than faulting in a page from the swap 
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device. In both cases al! the mapping information is immediately available or can be easily 
computed from incore information. Vwrite works with vread to allow efficient updating of 
large data which is only partially accessed, by rewriting to the file only those pages which have 
been modified.. 

Downward compatibility wit non-virtual systems is achieved by the fact that read and 
write calls have the same semant. s as vread and vwrite calls; only the efficiency is different. 
Upward extensibility into a more general sharing scheme is also easy to provide, as vread can 
be easily simulated by a mapping of the file into the address space with a copy-on-write 
mechanism on the pages. Although the current mechanism does not share copies of the same 
page if it is vread twice, the semantics of the system call do nbt prohibit such an implementa¬ 
tion if used with a copy-on-write mechanism. Note that vwrite can also be simulated by a 
map-out like mechanism. 

5.3. New Loader Format 

The same mechanism that is used to implement the vread system call is used to provide a 
load format where the pages of the executable image are not pre-loaded, but rather initialized 
on demand, with the page block numbers only being bound into the page tables at the point of 
exec. The only change from the other UNIX load formats in this new format is the alignment of 
data in the load file on a page boundary. Large processes using this format can begin execution 
very quickly, with page faults causing reading from the executed file. 


6. Perspective 

There are a number of facilities which have not been implemented in the first release of 
the system as described here. 

For example, there are plans to change the system to use 1.024 byte disk blocks rather 
than 512 byte blocks. It has been observed that in many cases the system is limited by the 
number of disk transactions that can be made per second. Larger disk blocks will help improve 
disk throughput. On machines with large real memories, using page-pairs in the paging system 
will also reduce the overhead of the replacement algorithm and increase throughput to the pag¬ 
ing device. Since a page contains only 128 words, it does not provide a great deal of locality. It 
is expected that using 1024 byte pages (in effect) will not degrade the effective memory size 
significantly. 

Another problem associated with the small page size of the VAX architecture is the rate of 
growth of user page tables.t For very large processes, this not only results in a significant 
amount of real memory allocated to page tables, but also increases the system overhead in deal¬ 
ing with them. Effectively supporting extremely large virtual address spaces will require han¬ 
dling translation faults at the first level (i.e., page table faults) whereby real memory for page 
tables is allocated on demand. 

The system changes as presented here are the result of approximately one man year of 
effort. The base system (a prerelease of UNIX/32V that was maintained as the production system 
during the paging development) and the paged system consist of approximately 14800 and 
16800 total source lines, respectively. The brake down of these numbers amongst the various 
types of source is presented in Table 6.1. Also presented is a comparison that excludes com¬ 
ment lines from the source of the two systems.# We note that the actual number of lines 
modified to obtain the paged system is considerably more than the simple net difference for each 
category. In the meantime, for equal configurations, the -resident kernel size has increased by 
about 12K bytes of program and 26K bytes of data resulting in a total size of about 164K bytes 
(for a 1 megabyte main memory system). 

t Since a page table entry is 4 bytes, user page tables grow one byte for each 128 bytes of user virtual 
memory. 

4 For the C source code the number of occurrences of was used as an estimate of the actual number of 
source lines that were not comments. 
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Frequency 



Category 

Total Source Lines | 

Non Comment Lines 

Base 

System 

Paged 

System 

Bast 

System 

Paged 

System 

Assembly Code 

1292 

1353 

1062 

1015 

C Code 

11581 

13405 

4891 

5614 

Header Files 

1997 

2068 

. 1223 

1316 


Table 6.1. Source Code Volume Comparison 
7. Measurement Results 

The system has been instrumented to collect data related to various paging system activi¬ 
ties as well as workload characteristics in general. 

7.1. Process Virtual Size Distribution 

Being one of the few quantifiable characteristics of a workload that is also of importance 
in a virtual memory environment, system-wide distribution of process virtual size was moni¬ 
tored. 




15OK 300K 450K 600K 750K 900K 1.05M 1.2M : 20K 40K 60K 80K 100K 120K 140K 160K 180K 200K 

Data Segment Size (bytes) I Stack Segment Size (bytes) 

(a) ' (b) 

Fig. 7.1.1. Process size distribution: (a) data, (b) stack segments 


The results of integrating process data and stack segment size over user CPU time are shown in 
Fig. 7.1.1. The two sets of measurements taken almost one month apart indicate an increase 
from 29.6K to 161.7K bytes and 6.8K to 9.8K bytes for the mean data and stack segment sizes, 
respectively. The October 18 measurement corresponds to early stages of production run of the 
system and is representative of the pre-virtual memory workload. The significant increase in the 
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average data segment size within this short period is attributed to the rapid growth of Lisp sys¬ 
tems including macsyma. The insignificant contribution of the stack segment to total p r ocess 
size is a characteristic of our C intensive workload. 


7.2. Page Fault Service Time Distribution 

As described in §2, a page can be in three states. Reference to a page in memory but 
invalid causes a reclaim , whereas reference to one not in memory results in a page-in operation. 
The service time distributions for these two different types of faults is shown in Fig. 7.2.1. 




Time (psec) 

(a) 

Fig. 7.2.1. Fault service time distribution: (a) reclaim, (b) page-in 


For the reclaim time distribution, the first peak corresponds to reclaims from the loop , while the 
second bump and the long tail are accounted for by the load dependent component of the ser¬ 
vice time due to reclaiming pages of shared text segments.f Note that the mean reclaim time of 
208 microseconds per reclaim represents a negligible delay to user programs. Furthermore, the 
overall system cost of reclaims through which we simulate the missing reference bits of the 
architecture has been measured to be less than 0.05% of all CPU cycles.! 

The page-in service time distribution is highly load dependent since it includes all of the 
queueing as well as process rescheduling delays. The configuration with the paging activity on 
the same arm (an RM-03 equivalent disk) as the temporary and the root file systems results in 
a 54.9 msec total service time. The significant number services completed under 20 msec are 
due to the track buffering capability of the controller being used. 


t This operation requires updating the page tables of all processes currently executing the same code, thus 
varies with load. 

$ This cost is actually a function of the paging activity. The number reported here has been averaged over a 
28 hour period in a 1.25M byte real memory configuration 
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7.3. Comparison with Swap Based System 

In an effort to compare the performance of the system before and after the addition of 
virtual memory, a script driven workload was run in a stand-alone manner in both systems 
under identical configurations consisting of a 1 megabyte main memory, an RP-06 servicing the 
user file system and an RM-03 shared by the root and temporary file systems in addition to the 
swapping/paging activity. The swap based system used for this comparison was quite sophisti¬ 
cated, performing scatter loading of processes into memory an$ partially swapping processes to 
obtain free memory. 

The basic unit of work generated by the script is made up of four concurrent terminal ses¬ 
sions: • 

lisp A recompilation, using a Lisp compiler, of a portion of the compiler, and a “dum- 
plisp” using the lisp interpreter to create a new binary version of the compiler. 
Under the paging system, a system load format which caused the interpreter and 
compiler to be demand loaded (rather than preloaded) was used (cf. §5.3). 

ceomp An editor session followed by a recompilation and loading of several C programs 
which Support the line printer. 

typeset An editor session followed by typesetting of a paper involving mathematical process¬ 
ing, producing output for a Versatec raster plotter. 

trivial Repeated execution of a trivial command (printing the date) every few seconds. 


f ' Staggered multiple initiations of from one to four of these terminal sessions were used to 
. ( create increasing levels of load on the system. Figure 7.3.1 gives the average completion time 
for each category of session under the tv/o systems. 



Fig. 7.3.1. Average completion times 
(a) lisp, (b) typeset, (c) ccomp, (d) trivial 


For the non-trivial sessions, completion times were very similar under the two systems, with 
the paging version of the system running (in all but one case) faster, anddhe swap based sys¬ 
tem departing from linear degradation more rapidly. This trend is most noticeable in the 
response time for the trivial sessions. Systemwide measures collected .during the experiments 
are given in Figure 7.3.2. 
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' Fig. 7.3.2. Systemwide measurements 

(a) total (b) average completion time, (c) system time, (d) total page traffic 


These measurements show the same trend for both total and average completion times as for 
individual sessions, with the paging system slightly faster ' and degrading more linearly 
within the measured range. System overhead was uniformly greater under the paging system, 
constituting 26 percent of the CPU utilization as compared to 20 percent under the swap sys¬ 
tem. User CPU utilization was, however, uniformly greater under the paging system, averaging 
48 percent, while the swap based system averaged only 42 percent. 

Finally, the total page traffic generated by the two systems was measured. This accounts 
for both paging and swapping traffic under the paging system, as well as transfer of all system 
information (control blocks and page tables) under both systems. Although the paging system 
resulted in far fewer total pages transferred, the actual number of transactions required^ to 
accomplish this was much greater since most data transfers, under the paging system, are ue 
to paging rather than swapping activity, and thus take place in very small (512 byte) quantities. 
We are currently installing modifications in the system to use larger block sizes in both the h e 
and paging subsystems, and expect improved performance from these changes. 
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Getting started with... 

Berkeley Software for UNIXf on the VAX* 

(The third Berkeley Software Distribution 


A package of software for UNIX developed at the Computer Science Division of the 
University of California at Berkeley is installed on our system. This package includes a new 
version of the operating system kernel which supports a virtual memory, demand-paged 
environment. While the kernel change should be transparent to most programs, there are some 
things you should know if you plan to run large programs to optimize their performance in a 
virtual memory environment. There are also a number of other new programs which are now 
installed on our machine; the more important of these are described below. 

t 

Documentation 

The new software is described in two new volumes of documentation. The first is a new 
version of volume 1 of the UNIX programmers manual which has integrated manual pages for 
the distributed software and incorporates changes to the system made while introducing the vir¬ 
tual memory facility. The second volume of documentation is numbered volume 2c, of which 
this paper is a section. This volume contains papers about programs which are in the distribu¬ 
tion package. 

Where are the programs? 

Most new programs from Berkeley reside in the directory /usr/ucb. A major exception is 
the C shell, csh, which lives in /bin. We will later describe how you can arrange for the pro¬ 
grams in the distribution to be part of your normal working environment. 

Making use of the Virtual Memory 

With a virtual memory system, it is no longer necessary for ranning programs to be fully 
resident in-memory. Programs need have only a subset of their address space resident in 
memory, and pages which are not resident in memory will be faulted into memory if they are 
needed. This allows programs larger than memory to run, but also places a penalty on pro¬ 
grams which do not exhibit locality. It is important to structure large programs so that at any 
one time they are referring to as small a number of pages as possible. 

If you are going to create very large programs, then you should know about a new 
demand load format. This format causes large programs to begin execution more rapidly, 
without loading in all the pages of the program before execution begins. It is most suitable for 
programs which are large (say > 50K bytes of program and initialized data), and especially 
when a program has a large number of facilities, not all of which are used in any one run. To 
create an executable file with this format, you use the — z loader directive; thus you can say “cc 
~z ...” or “f77 —z The file command will show such files to be “demand paged pure 
executable” files. See the manual page for Id in section 1 and a.oui in section 5 of volume 1 of 
the manual for more information. 

If you have or are writing a large program which creates new processes as children, then 
you should know about vfork system call. The fork system call creates a new process by copy¬ 
ing the data space of the parent process to create a child process. In a virtual environment this 
is very expensive. Vfork allows creation of a new process without copying the parent’s address 

tuNix is a trademark of Bell Laboratories. 

tVAX is a trademark of Digital Equipment Corporation. 
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space by letting the parent execute in the child’s system context. The parent can set uo the 
input/output for the child and then return to its own context after a call to exec or _ex . If 
you use the standard I/O routine system0 to execute commands from within your programs, 
then vfork will be used automatically. If you have been calling fork yourself, you should read 
the manual page for vfork and use it when you can. 

In order that efficient random access be permitted in a portable way to large data files, a 
pair of new system calls has been added: vread and vwrite. These calls resemble the normal 
UNIX read and write calls, but are potentially much more efficient for sparse and random access 
to large data files. Yread does not cause all the data which is virtually read to be immediately 
transferred to your address space. Rather, the data can be fetched as required by references, at 
the systems discretion. At the point of the vread, the system merely computes the disk block 
numbers of the corresponding pages and stores these in the page tables. Faulting in a page 
from the file system is thus no more expensive than faulting in a page from the paging device. 
In both cases all the-mapping information is immediately available or can be easily computed 
from incore information. Vwrite works with vread to allow efficient updating of large data 
which is only paftially accessed, by rewriting to the file only those pages which have been 
modified. 

Downward compatibility to non-virtual systems is achieved by the fact that read and write 
calls have the same semantics as vread and vwrite calls; only the efficiency is different. If you 
have programs which access large files, and do so sparsely, read the manual pages for vread and 
■* .vwrite in section 2 of volume 1 of the manual. 

New Languages for the VAX 

There are now available interpreters for apl and Pascal for the VAX, and a LISP system 
supporting a dialect of LISP compatible with a large subset of MACLISP. The apl interpreter is 
the 11 version, moved to the VAX, and now has a large workspace capability (but has not been 
extensively used.) The Pascal system has been used extensively for instruction and research 
and is the same system which was available on the PDP-11. The only limitations of the Pascal 
system are a maximum of 32K bytes per stack frame (due to the implementation of the inter¬ 
preter), and 64K bytes per variable allocated with new. Essentially arbitrary sized programs can 
be run with the system, which supports a very standard Pascal with no language extensions. 
The Pascal system features very good error diagnostics, and includes a source level execution 
profiling facility.! 

The LISP system, “Franz Lisp”, was developed at Berkeley as part of a project to move 
the MIT macsyma system from the pdp-io to the vax. A compiler liszt for Franz Lisp, written 
at Bell Laboratories, is also included with the system. 

For more information about apl refer to its manual page in volume 1 of the manual. The 
Pascal system consists of the programs pi, px, pix, pxp, pxrcf, and pic, all of which are docu¬ 
mented in section 1 of volume 1 of the manual. There is also a paper introducing the system 
in volume 2c. The LISP system is described in The Franz Lisp Manual in volume 2c of the 
manual. 

A display editor — vi ’ 

The system includes the latest version of the display editor vi which runs on a large 
number of intelligent and unintelligent display terminals. This editor runs using a terminal 
description data base and a library of routines for writing terminal independent programs which 
is also supplied. The editor has a mnemonic command set which is easy to learn and 
remember, and deals with the hierarchical structure of documents in a natural way. Editor 
users are protected against loss of work if the system crashes, and against casual mistakes by a 
general undo facility as well as visual feedback. The editor is quite usable even on low speed 

t A compiler for Pascal based on this system is currently being developed, but is not part of this distribution. 
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lines and dumb terminals. 

For users who prefer line oriented editing, the ex command enters the same editor, but in 
a line oriented editing mode. For beginners who have never used a line editor before, there is 
a version of the editor known as edit which has a well-written tutorial introducing it. 

For more information about edit see Edit: a Tutorial in volume 2c of the manual. The line 
editor features are described in the Ex Reference Manual which is in volume 2c of the manual. 
Also in volume 2c zre An Introduction to Display Editing with Vi and a vi reference card. 

Command and mail processing programs 

There is also a new command processor csh which caters to interactive users by providing 
a history list of recent commands, which can be easily repeated. The shell also has a powerful 
macro-like aliasing facility which can be used to tailor a friendly command environment. Csh is 
implemented so that-both it and the standard shell /bin/sh can be run on the same system. 

The Introduction to the C shell introduces the shell. If you have used the standard shell, 
then you should especially read about the history and alias mechanisms of the shell. 

In order thal the manual distributed with the tape correspond to the commands which are 
available to you, you should set the execution search path in your shell startup file to include 
/usr/ucb. If you use /bin/sh you should place the lines: 

PATH=:/ usr/ucb:/bin:/usr/bin 
export PATH 

f. . 

• , in the file .profile in your home directory. If you use csh, then the commands 

setenv PATH /usr/ucb:/bin:/usr/bin: 
set path—(/usr/ucb /bin /usr/bin .) 

should be placed in your file .cshrc, also in your home directory. The C shell hashes the loca¬ 
tions of commands so as to speed command execution. Placing the current directory at the 
end of your search path speeds the location and execution of commands. 

For sending and receiving mail, a new interactive mail processing command provides a 
hospitable environment, supporting items such as subject and carbon copy fields, and allowing 
creation of distribution lists. This command also has a mail reading mode which makes it con¬ 
venient to deal with large volumes of mail. See the manual page for mail in section 1, volume 
1 of the manual, and the Mail reference Manual in volume 2c of the manual for more details. 

Better debugger support 

A version of the symbolic debugger sdb is provided which now can debug FORTRAN 77 
programs. The assembler has been rewritten and the C compiler modified to reduce greatly the 
overhead of using the symbolic debugger, making it much more feasible for heavy use. If you 
are interested, then you should read the new document for sdb, provided in volume 2c. 

Other software 

The distribution includes a copy of the ciruit analysis program spice, which has been tran¬ 
sported to the VAX. You can read about it in its reference manual in volume 2c of the manual. 
Other new programs include programs to simulate the phototypesetter on 200 bpi plotters, a 
common system messages facility, routines for data compression, a modified version of the 
standard I/O library permitting simultaneous reads and writes, a network for connecting hetero¬ 
geneous UNIX systems at low cost (1 tty port per , 'onnection per machine and no system 
changes), and a new, flexible macro package for n/troff —me. New command whatis and apro¬ 
pos can be used to identify programs and to locate commands based on keywords. Try 

cd /usr/ucb , 

whatis * 
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and to find out about Pascal: 
apropos pascal 

Monitoring the new system 

If you want to see what is happening in the new system, you can use the new vmstat 
command, described in section 1 of the manual, which shows the current virtual load on the 
system. The system recomputes the information printed by vmstat every five seconds, so a 
“■vmstat 5” is a good command to try. 

To see what processes are active virtual processes, you can do 

ps av 

The command 
ps v 

will print only the active processes which you are running. 
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Berkeley Software for UNXXt on the VAX* 

(The Third Berkeley Software Distribution) 


A new package of software for UNIX will be available from the Computer Science Division 
of the University of California at Berkeley in early December, 1979. This is a package of 
software for UNix/ 32 Vf licensees, and includes a pag,ed version of the kernel for the VAX, as 
well as a large number of other programs. The tape includes: 

New Languages for the VAX 

Interpreters for^APL, LISP and Pascal. The apl interpreter is the PDP-11 version, moved to 
the VAX. The LISP system, known as “Franz Lisp”, is written in C and LISP, includes both an 
interpreter and a compiler, and is compatible with a large subset of Maclisp. The Pascal system 
is the instructional system which has been distributed previously for PDP-ll’s*. The language 
implemented is very close to standard Pascal, and features excellent diagnostics, and a source 
level execution profiling facility. 

New System Facilities 

t. , The system is now fully and transparently demand paged. As distributed it will support 
- individual process sizes up to 8M of data and 4M of program. These numbers can be increased 
to 16M bytes of data and stack and 16M bytes of program easily given the availability of a rea¬ 
sonable amount of disk space to which to page. Description is given of steps necessary to 
further increase these limits. 

A new load-on-demand format allows large processes to start quickly. A vfork system call 
allows a large process to execute other processes without copying its data space. Virtual ver¬ 
sions of the read and write system calls known as vread and vwrite permit fast random access to 
large files, fetching data pages as needed, and rewriting only changed pages. The system sup¬ 
ports UNIBUS disk drives, and can access and update files on the console’s floppy disk drive, 

A display editor 

The tape includes the display editor, vi, (vee-eye) which runs on a large number of intelli¬ 
gent and unintelligent display terminals. This editor uses a terminal description data base and a 
library of routines for writing terminal independent programs which is also supplied. The editor 
has a mnemonic command set which is easy to learn and remember, and deals with the 
hierarchical structure of documents in a natural way. Editor users are protected against loss of 
work if the system crashes, and against casual mistakes by a general undo facility as well as 
visual feedback. The editor is quite usable even on low speed lines and dumb terminals. 


Command and mail processing programs 

The tape also includes a new command processor csh which caters to interactive users by 
providing a history mechanism so that recently given commands can be easily repeated. The 
shell also has a powerful macro-like aliasing facility which can be used to tailor a friendly, per¬ 
sonalized, command environment. A new interactive mail processing command supports items 
such as subject and carbon copy fields, and distribution lists, and makes it convenient to deal 
with large volumes of mail. 


UNIX and unix/32v arc tradcmar 

♦vax and pdp arc trademarks of 


IfC of Poll I nkr 



Digital Equipment Corporation. 
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Better debugger support 

A version of the symbolic debugger sdb is provided which now can be used to debug 
FORTRAN 77 programs. The assembler has been rewritten and the C compiler modified to 
reduce greatly the overhead of using the symbolic debugger. 

Other software 

Also included are a number of other useful packages including the circuit analysis pro¬ 
gram SPICE, programs to simulate the phototypesetter on 200 bpi dot-matrix plotters (these pro¬ 
grams were moved from the PDP-11 to the VAX and a large number of fonts available on the 
ARPANET have been converted to the required format), a bulletin board program, routines for 
data compression, a modified version of the standard I/O library permitting simultaneous reads 
and writes, a slow-speed network for connecting heterogeneous UNIX systems at low cost (1 
tty port per connection per machine and no system changes), and a new, flexible macro package 
for nr off and troff called —me. 

Source code, binaries and machine readable versions of all documentation are included 
with the tape. We supply the magnetic tape on which the software is written. 

To receive the tape make two additional copies oi the the attached agreement, sign and 
return 2 of the 3 copies with a check for $200 U.S. payable to “Regents, University of Califor¬ 
nia,” and a copy of your UNIX/32V license agreement to: 

, Berkeley Software Distribution for UNIX 

c/o Keith Sklower 

Computer Science Division, Department of EECS 
Evans Hall 

University of California, Berkeley 
Berkeley, California 94720 

We will return a fully executed copy of the agreement to you with the distribution. 

Included with the tape will be two volumes of documentation. The first is a programmers 
manual (similar to UNIX/32V Volume 1) modified and updated to correspond accurately to the 
distributed system. The second is a volume of documents (Volume 20 similar to the two 
standard volumes (2A and 2B) describing the major packages on the tape. 

If you have questions about this tape they can be directed to Keith Sklower at the address 
above or at (415) 642-4972 or leave messages for Keith at 642-1024. 
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Comments on the performance of UNIXt on the YAXt 


William Joy 

Computer Science Division 

Electrical Engineering and Computer Science'Department 
University of California, Berkeley 
Berkeley, Ca. 94720 


Introduction 

A recent paper written by David Kashtan of SRI discussed some measurements he made com¬ 
paring the performance of the two available operating systems for the VAX: UNIX and VMS. The 
UNIX system measured by Kashtan was UNIX/22V as extended by U.C. Berkeley to support virtual 
memory. The measurements were made on version 2.1 of the Berkeley system and version 1.6 of 
VMS. This note seeks to help interpret these results and more clearly point out the differences in 
approach and current state of the two systems. 

We will show that the differences that Kashtan measured have simple explanations, and that 
the comparison of performance on such benchmarks can guide short-term tuning. Long-term deci¬ 
sions as to which system to run have been made on the basis of portability and flexibility considera¬ 
tions. The results reporte here confirm the correctness of the choice of UNIX on these grounds by 
confirming its flexibility. The detected slownesses in UNIX are neither fundamental to the way UNIX 
does business, nor difficult to mitigate if it is felt important to work on the areas i n question. We 
will discuss simple changes to the UNIX system that have been made improving performance in the 
areas mentioned. The improvements described here were incorporated in the production system at 
Berkeley during the first three weeks of March, 1980. 

System call overhead 

Let us first consider system call overhead, which underlies the measurements that were made 
of “Context Switching Performance.” From the measurements Kashtan gives for context switching 
overheads, we can first estimate some fundamental times for the VMS system: a simple system call 
(e.g. an event flag call) on VMS appears to take about 114/xsec, and a context switch about 178|xsec. 
To get an idea of scale here, the VAX memory cycle time (for cache write-through) is about 
1.2psec, the simplest procedure call jsb and matching return rsb takes about 4psec, and the high 
level language call instruction calls and matching return ret takes a minimum of 16psec, more if 
some registers are to be saved. 

On the version of UNIX Kashtan had, a trivial system call took about 350psec. This is almost 
three times as much time as VMS used. Where was tliis time being spent? The following is a rough 
breakdown: 

€> 


t UNIX is a Trademark of Bell laboratories. 

? VAX and vws are trademarks of Digital Equipment Corporation. 
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save registers, call trap 40|xsec 

setup in trap for syscall 30|xsec 

fetching arguments for syscall 60|xsec 

saving context in case interrupted lOOpsec 

system call proper 30(xsec 

checking to see if signals waiting 30psec 

recomputing process priority 60psec 

restore registers, rei _ 30|xsec 


This code is all modularly and straightforwardly written, using a small set of primitives (for 
instance the “non-Iocal-goto” at the point of an interrupted system call is effected by using part of 
the primitives performing context switching.) There are a many calls to small routines here. Each 
system call argument is fetched by calling a primitive fuword which fetches a word from user-space 
after performing access checks. The saving of the context involves a procedure call, as does check¬ 
ing for signals and recomputing priority. Altogether, when a system call has two arguments, seven 
calls instructions are performed here to various subroutines. These alone take roughly 105)j.sec, 
assuming no registers are specified to be saved in the entry mask. Since the saving and restoring of 
the registers for the call to trap takes an additional 20psec, this means that 125sec is accounted for 
without any of the work for the system call itself. Clearly, if this is to be sped up, some of these 
routine calls must be eliminated. 

With minor changes to the code for trap handling we have sped up the system call time so that 
the basic overhead is 140psec, instead of 350|xsec. This was done as follows: 

1. The trap routine’s “entry mask” as generated by the C compiler is modified at boot time to 
save all possible registers, to avoid saving some registers twice.f 

2. The routine to handle system calls was broken out from the handler for all other traps and 

called syscall. This minimizes the changes to the code for handling other traps in making the 
following improvements, and allows some small simplifications. j 

3. Fetching of all arguments can be done by a single routine copy in, instead of calling fuword for 
each argument.! Since the number of argument words to system calls is very small it is easy to 
expand the copy in primitive in line in this critical path. It can be implemented, in this special 
case, by two basic VAX instructions: a prober to determine accessibility, and a movc3 to move 
the arguments into system space.* 

4. To restore after an interrupted system call it suffices to be able to locate the frame pointer and 
stack pointer of the original syscall procedure. We can effect this context save using just three 
movi instructions. 

5. A subroutine call to check if signals are pending to this process can be avoided in almost all 
cases by first checking the mask of pending signals. This partial inline expansion of the issig 
routine reduces the overhead here by at least 16|xsec. 

6. Recomputation of the process priority at each system call can be avoided by “unloading” the 
p_pri per-process information field. The system used a single field to encode both the 
processes user-CPU usage dependent scheduling priority, and priorities related to process 
blocking during system calls. By adding a p_usrpri information field reflecting user-CPU 
usage, the code in trap reduces to an assignment of this priority to the p_pri field, instead of 
recomputing the user priority after each system call. The space overhead is one byte per 


t Previously, the assembly language for the system saved all the registers, and made no assumptions about .what 
trap itself did, resulting in registers which were used for register variables within trap being saved twice, 
t The copy of arguments into system space is important to avoid severe system security problems with system calls 
that self-modify their arguments after they have been checked for consistency. 

* The copyin primitive is complicated in the general case by the rather strange semantics of prober, which only 
checks accessibility of the first and last pages in the address range it ij given. This forces software to loop over 
each page involved in the copyin. The looping checks are not needed in the code which fetches arguments for sys¬ 
tem calls, because there are at most 16 bytes of direct arguments to a system call. 
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process, the way in which the system works is unaffected, and the code is then somewhat 
easier to understand. 

These changes reflect only local optimization of the code; the substance of system call handling 
has not been changed. It is simply the case that calls sequence used by the C compiler for pro¬ 
cedure calls is relatively expensive. Expansion of small routines and machine dependent primitives 
in critical paths is an important technique that can be used to quickly and easily mitigate routine 
call slowness when profiling or other measurements show this to be necessary. 

It should be noted that the use of calls and passing of routine parameters on the stack in the 
current VAX C compiler is different from the way VMS is coded, t VMS makes almost exclusive use of 
the jsb and rsb instructions to avoid the overhead of calls, and has stylized conventions for assign¬ 
ment of registers across assembly language routines so that pushing and popping of data on the stack 
can be avoided. This basic mechanism is a good deal faster than calls if register usage can be 
optimized carefully, but tends to make the code harder to change. 

Perspective. Our departmental VAX running 20 users in the afternoon does an average of about 100 
system calls a second. Under the old system the basic overhead for these 100 system calls was about 
3.5% of the available processor time. The faster system call interface reduces this to about 1.4%. 

The fact that the primitives for critical sections (the spl set priority level routines) were imple¬ 
mented by calls to the two instructions implementing them accounted for nearly 1/3 of the time in a 
read or write system call. Since over half of all system calls are reads or writes, a simple inline 
expansion of these primitives accounted for more improvement for reads and writes than the 
changes to the basic system call routines. 


Context switching 

The context switching tests attempted to measure how fast the systems could pass control from 
one process to another. Kashtan measured that VMS could switch between two processes at a net 
rate of 2000 switches per second using the “event flag” mechanism to signal process exchange. On 
UNIX, using the kill system call as a signalling mechanism, Kashtan found that the maximum switch¬ 
ing rate was 210 per second. He estimated that the basic switching rate of the two systems was 
5600 switches per second on VMS and 425 per second on Unix. He concluded: 

“UNIX, as currently implemented, has to do considerably more work when scheduling a 
process. In addition, UNIX must do a context switch to process number 0 in order to 
make the decision as to which process to be run next. Even this cannot explain the 
greater than 10 to 1 difference in the performance of the two systems.” 

We will see that this difference involves no great mystery, and that, in fact, UNIX can be made to 
context switch nearly as fast as VMS does simply by changing the (assembly-language) primitive sup¬ 
port of context switching to match the hardware. Remaining differences in timings are not the 
result of “inefficiencies” in UNIX, but from the close fit of the VMS strategy with the VAX architec¬ 
ture.:!: 

The VAX instruction set caters to a certain regime of context switching. To make its idea of 
context switching amenable to UNIX, it sufficed to include the C library routines setjmp and longjmp 
in the system to handle internal non-local goto’s, and to provide a new context switch primitive 
resume that corresponds to a svpctx followed by a Idpctx. 

There were two further problems with the context switching primitives: as on the PDP-11, the 
per-process system stack and control information were kept in kernel address space. It is much 
more efficient on the VAX to keep this information in the P0 or PI region where it will be remapped 
when Idpctx is used. This change was made by moving this information to the base of the stack. 


t VMS is written in assembly language. 

$ Specifically, blocked jobs are queued on linked lists with bisque (insert in queue instruction) for removal with 
remque (remove from queue instruction) while UNIX uses subroutines which hash sleeping jobs. The overhead of 

ral ltnn th#> Koclwwl onW ..4 —1_:— .t— 1 i_ _ ■ •« .. .. - ■ 
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The final problem was the one Kashtan noted; that the system switched to process 0 each time 
before switching to the eventual target for the switch. This was done only to allow the system to 
idle in process 0. This is cleaner than idling on an arbitrary process because, e.g., the process that 
called swtch might have just swapped itself out, and we would then be running on system control 
information in memory that had been released. There is actually no problem in not switching to 
process 0, since the system cannot run anything but interrupt code until we come out of the idle 
loop in switch. 

After these changes, the times for context switching improved dramatically, dropping by more 
than a factor of 5 to about 400p.sec per switch. Examination of the remaining overhead revealed 
that there were several small routines being called in critical paths in the sleep routine. This was 
simply eliminated by an inline expansion of the code. 

After the above changes had been made, we measured the system context switch time and 
broke it down as follows: 

blocking time in sleep 50 fisec 

rescheduling time in swtch 60p,sec 

resume primitive llOjxsec 

unblocking time in wakeup 50p.sec 

Thus the context switching time had been reduced to 270|xsec, a factor of seven improvement. 

In practice, there is a further efficiency issue here that is not pointed out by the benchmarks. 
The swtch routine used a search over a linked list testing to find the highest priority job on the list. 
The VMS system uses the ffs (find first set bit), insque and remque instructions and an array of run- 
queues ordered by priority to make this selection as rapid as possible. We coded this new swtch 
routine (it is about 10 lines of assembly language), and changed the system so that only truly runn¬ 
able jobs were on the run queue (the old system left runnable jobs on the run queue even when they 
were swapped out). This allows the swtch primitive to run in time independent of the length of the 
run queue. 

Perspective. Let us try to get some perspective for timesharing UNIX systems (such as the machine 
where this paper is being prepared). The system is currently supporting about 20 users, and doing 
roughly 50 context switches per second. The original code, which ran in about 2 milliseconds per 
context switch would have cost 10% of the machine in context switching. A version of the system 
changed to not switch to process 0 in swtch ran in about 800|xsec per context switch, resulting in a 
average context switching overhead of about 4% of the machine. The current system, with all the 
changes mentioned above, uses roughly 1.3% of the machine in context switching. 

If applications need absolutely fastest possible context switching time, then UNIX would have to 
be changed so that the calls to sleep and wakeup were less expensive. Roughly half of the lOOjxsec 
spent in these routines could be saved by writing them in assembly language and calling them with 
jsb instead of calls. They would then not have the 16p.sec overhead of calls and ret and could use 
registers C-5 for scratch work instead of saving and restoring registers 6-11, the registers normally 
used by the C compiler for register variables. Measurements of the incidence of sleep and wakeup 
in our environment do not justify such a change. 

IPC Mechanisms 

The measurements here were of the time for a process to send either 4 or 512 byte packets to 
another process. The system call overhead and context switch overhead reductions improved perfor¬ 
mance here, as did the inline expansion of the priority level routines. Finally, a few primitives (file 
lock and unlock, and user/kemel data movement) were defined as macros or partially expanded 
inline to save time. 

The improvements measured in 4 byte packet transmission are reported in the following table. 
The three experiments muisured the rate at which a process could send 4 bytes of data to itself, to 
send 4 bytes to another process, and send 4 bytes to another process and receive a 4 byte reply. 
The measurements give the number of 4 byte packets transferred each second. 
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Mechanism 

To self 

One-way 

Two-way 

VMS Mailboxes 

440 

297 

363 

UNIX Before 

370 

294 

281 

UNIX After 

819 

714 

600 


Perspective. The overhead in using pipes on UNIX was reduced greatly by simply eliminating 
unnecessary calls of primitives. The ultimate speed of message passing through pipes should be 
between 600 and 1000 packets per second, when no buffering is taking place. 

We see no reason that the mpx multiplexed file mechanism cannot be made to operate at 
speeds greater than that of the pipe mechanism. Most of the overhead in the pipe mechanism is in 
the block input/output system, which must be called to access and release the file system blocks of a 
pipe on eaoh read or write operation. With mpx, messages will be buffered in memory, and this 
expensive indexing can be avoided. By placing the mpx buffers in paged memory, it should be sim¬ 
ple to provide fast response when activity is heavy without tying up large amounts of core if buf¬ 
fered data accumulates. 

Paging 

Kashtan measured performance of the paging system on two kinds of paging activity: sequen¬ 
tial and random. He also tried varying, in the random case, the standard deviation of successive 
references. Never did his paging experiments involve any “memory” in the behavior of the 
processes, and they modified every referenced page. To help cope with jobs which reference or 
modify large numbers of pages rapidly, we have changed the system to read and write clusters of 
pages from the paging device. This helps to minimize the incidence of pageins and pageouts. Write 
clustering alone make nearly a factor of two improvement in the time taking to run most of 
Kashtan’s benchmarks, since he modifies every page. Read clustering improves the performance of 
jobs that sequentially access virtual memory, and has a good, but less significant impact on other 
jobs, provided it is used in moderation.* 

UNIX attempts to determine the set of pages that have been recently referenced using software 
simulation of the page reference bits that are not provided by the hardware. While experiments 
show that this works well on jobs that are typical of our working environment, jobs such as those 
run by Kashtan are not well observed by this method. A simple circular or random page replace¬ 
ment algorithm is nearly optimal for the experiments that he made. Disabling the gathering and use 
of page reference information for “memoryless” jobs, and instead relying more on simple circular or 
random replacement, more akm to the algorithm used by VMS, improves system performance also. 
We have added an advisory system call that informs the system that the process will not be well 
observed by the normal reference information gathering algorithm. This is used by LISP during gar¬ 
bage collection. One can easily imagine processes informing the system of strongly sequential or 
random paging behavior. 

The following table gives the relative times for the sequential and random benchmarks on 
Kashtan’s machine and ours. His benchmark used 8192 pages of process virtual space while ours 
used only 7500. He had 2 megabytes of real memory while we had only 1.75. These numbers are 
unfortunately not exactly comparable, and we hope to run the experiment on a 2 megabyte machine 
in the near future. 


Experiment 

VMS 

Old UNIX 

New UNIX 

Sequential access 

4:32 

20:16 

6:45 

Random access 

6:00 

17:24 

10:37 


• An excessive amount of prepaging tends to overload the page replacement algorithm by creating an artificially 
high demand for memory. We can mitigate this somewhat by not validating the pre-paged pages, forcing a “re¬ 
claim” to occur quickly if the page is not to be discarded. Observations show, however, that prepaging more than 
2048 bytes (2 of the basic 1024 byte pages that UNIX uses) at a time tends to degrade general system performance. 
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Kashtan also measured a program with random paging behavior (each successive reference was 
a random distance from the previous), with varying standard deviations of references. We report 
here the times Kashtan measured f~r VMS and UNIX when he made the experiment, and also meas¬ 
urements on our system before and after the changes mentioned above were introduced. Kashtan 
altered the basic paging strategy in VMS for these tests; the strategy used in UNIX here was always the 
one we use during general timesharing. 


Deviation 

VMS 

SRI UNIX 

Old Berk UNIX 

New Berk UNIX 

1 

0:16 

0:16 

0:23 

0:23 

10 

0:18 

0:16 

0:24 

0:24 

30 

0:25 

0:18 

0:66 

0:44 

40 

0:47 

1.08 


154 

50 

1:21 

3:54 

4:37 

238 

60 

153 

5:46 

6:21 

3:13 

80 

3:04 

6:38 

8:47 

4:53 

100 

3:27 

8:26 

10:28 

6:12 


These measurements are somewhat less than satisfactory for two reasons: the machines had 
different configurations making the results more difficult to interpret. Also, the experiments were 
too short,, and therefore the start-up time for the system to reach a staMe state has not been factored 
out of the results, f 

Perspective. The changes to the paging system to introduce clustering and for special treatment of 
processes that are not well handled by the normal techniques markedly improve the performance of 
UNIX on Kashtan s benchmarks. For the jobs we normally see on our system the improvement was 
not as noticeable. The effect of these changes on a standard synthetic workload that we had previ¬ 
ously run to evaluate system performance was negligible. 

Of far more importance to system performance was a revision ©f the swap scheduling algo¬ 
rithm. The previous version of the system used an oldest job first strategy for swapping out runn¬ 
able jobs. Changing the system to instead swap out the oldest of the n largest jobs and fiien slowing 
down the rate of swapping had dramatic effects under the heavy loads we have been encountering. 
This change is of much more importance to the typical UNIX installation than the other changes 
mentioned above. 

Conclusions 

In selecting between UNIX and VMS as an operating system for use in the arpa Image Under¬ 
standing and VLSI research communities, the UNIX system was chosen primarily out of concern for 
portability. For our purposes within the Computer Science Department at Berkeley, we have chosen 
UNIX because of its pliability to meet our needs. 

In the short term, there are areas of the UNIX system that are still suffering growing pains 
from the porting of the system to larger machines. We believe that the simplicity and modularity of 
the system, which are the keys to its portability, are also the reasons why UNIX is easy to tune, as 
this paper has demonstrated. 

We have by no means made all the changes to the system that will be needed by the arpa 
community. The throughput of the UNIX file system is more than adequate for our current time¬ 
sharing applications, but will not be great enough for large image processing applications, nor will it 
support page clustering to the file system, which will be essential if large files are to be shared 
among processes and paged from the file system. 

Changes to support such facilities have been designed, and implementation of these facilities is 

t For example, the Unix simulation of software reference bits is disabled when there is a large amount of free 
memory and a certain amount of time (15 to 20 seconds) is required after free memory drops below a threshold be¬ 
fore the reference information begins to become available. This interval is of the same length as the experiments, 
douding the results. 
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not difficult, because of the simplicity of the current system. We believe that we can, with minor 
changes to the file system structure, increase the throughput sufficiently to support most of these 
applications.t In the long term, a dirierent basic file system organization may be needed. We are 
examining other file system schemes, such as extent based file systems, confident that we can easily 
change the system to incorporate whatever scheme we decide upon. This flexibility, essential for 

S long-term viability of the system, is the reason we choose to use UNIX. 

i ; 

l' 



t We plan to initially implement a file system, based on the current inode structure, but which maintains a pool of 
8192 byte buffers as well as a pool of 1024 byte buffers. By judiciously allocating these 8192 byte contiguous 
chunks to large files, allowing programs which need access to large amounts of data to read such large blocks 
directly into their address space (without copying them in and out of the system buffer cache), we should achieve 
an significant improvement in file system throughput on applications which (for example) sequentially access large 
amounts of data. Such a clustering scheme wiil also allow the paging system to cluster the write-back of pages 
which are being shared by processes after being “mapped” from files. 
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LICENSE AGREEMENT 


THIS LICENSE AGREEMENT is made and entered into this _ day of 

_, 19_, by and between THE REGENTS OF THE UNIVERSITY OF CALI¬ 
FORNIA, a California corporation, hereinafter called "LICENSOR", and 

___ , a _ . having its principal office at 

_, hereinafter called "LICENSEE"; 


WITNESSETH: 

WHEREAS, LICENSOR owns and is the proprietor of the copyright of a certain com¬ 
puter program entitled, “Third Berkeley Software Distribution (3BSD)’’; and 

WHEREAS’, LICENSEE desires to obtain from LICENSOR, and LICENSOR desires to 
grant to LICENSEE, a license to use the aforementioned computer program; 

NOW, THEREFORE, in consideration of the mutual covenants, conditions and terms 
hereinafter set forth, and for other good and valuable consideration, LICENSOR hereby leases 
to LICENSEE the physical property described on annexed Schedule A ("Licensed Material") 

* ’subject to a non-transferrable, nonexclusive license ("License"), which is hereby granted to 
LICENSEE, to use such Licensed Material upon the terms and conditions hereinafter set forth; 
and LICENSEE hereby accepts such lease subject to the License solely upon such terms and 
conditions. 

1. Term. The term of this Agreement shall commence on the date hereof, and, unless 
sooner terminated as hereinafter set forth, shall extend indefinitely. 

2. Charges. As a fee for the use of the Licensed Material, LICENSEE shall pay LICEN¬ 
SOR a duplication charge of two hundred dollars ($200.00). LICENSEE may obtain new 
releases of the Licensed Material as LICENSOR may from time to time make available at a 
duplication charge of two hundred dollars ($200.00). Such new releases as are purchased by 
LICENSEE shall by subject to the terms and conditions of this Agreement. Such fee is due and 
payable when this License Agreement is returned, signed by the LICENSEE, and with a copy of 
the LICENSEE’S UNIX/32Vf Agreement. 

Such fee does not include local, state or federal taxes, and LICENSEE hereby agrees to 
pay all such taxes as may be imposed upon LICENSEE or LICENSOR with respect to the own¬ 
ership, leasing, licensing, rental, sale, purchase, possession or use of the Licensed Material. 

3. Maintenance and Update Services. Neither maintenance services nor update services 
are included in this Agreement. As used in the Agreement, the term "maintenance services" 
includes notice to LICENSEE of latent errors in the Licensed Material and rectification thereof. 

4. Title. LICENSEE agrees that the Licensed Material is, and shall at ail times remain, 
the property of LICENSOR. LICENSEE shall have no right, title or interest therein or thereto 
except as expressly set forth in this Agreement. However, those portions of the Licensed 
Material which are modifications of UNIX/32V and are so indicated on schedule A, are also 
governed by the LICENSEE’S agreement with Western Electric. 

5. Duplication and Disclosure. LICENSE agrees that all Licensed Material shall be held 
in confidence, that such Licensed Material is provided for the exclusive use of LICENSEE, or. 

the following CPU, namely_, Serial No. 

_ located at its__ facility, 


t UNIX is a trademark of Bell Laboratories 
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and any single replacement thereof, provided, that written notice of the replacement and its 
Serial Number is first given to LICENSOR. The LICENSEE warrants that this machine is 
licensed, by agreement with Western Electric, for using of the UNIX timesharing system, ver¬ 
sion 7 (UNIX/32V), dated _. The Licensed Material 3BSD shall not be 

duplicated, except as reasonably necessary to LICENSEE’S use of the Licensed Material under 
this Agreement or disclosed to others in whole or in part without the express written permis¬ 
sion of LICENSOR. IN PARTICULAR, LICENSEE AGREES THAT THE SOURCE FORM 
OF LICENSED MATERIAL SHALL NOT BE DISCLOSED TO OTHER LICENSEES 
WHETHER OR NOT SUCH OTHER LICENSEES HAVE CURRENT VERSIONS OF THE 
LICENSED MATERIAL. Such prohibitions on disclosure shall not apply to disclosure by 
LICENSEE to its employees and consultants if and to the extent that such disclosure is reason¬ 
ably necessary to LICENSEE’S use of the Licensed Material and provided that LICENSEE shall 
take all reasonable steps (including, but not limited to, all steps that LICENSEE takes with 
respect to information, data, and other tangible and intangible property of its own that it 
regards as confidential or proprietary) to ensure that such Licensed Material is not disclosed or 
duplicated in contravention of the provisions of the Agreement by such employees or consul¬ 
tants. 

6. Warranty and Limitation of Liability. LICENSOR MAKES NO WARRANTIES, 
EITHER EXPRESS OR IMPLIED, AS TO ANY MATTER WHATSOEVER, INCLUDING 
WITHOUT LIMITATION, THE CONDITION OF THE LICENSED MATERIAL, ITS MER¬ 
CHANTABILITY OR ITS FITNESS FOR ANY PARTICULAR PURPOSE. 

. LICENSOR shall not be liable for, and LICENSEE hereby assumes the risk of and will 
release and forever discharge LICENSOR, its agents, officers, assistants and employees thereof 
either in their individual capacities or by reason of their relationship to LICENSOR and its suc¬ 
cessors in respect to any expense, claim, liability, loss or damage (including any incidental or 
consequential damage) either direct or indirect, whether incurred, made or suffered by LICEN¬ 
SEE or by third parties, in connection with or in any way arising out of the furnishing, perfor¬ 
mance or use of the Licensed Material. In any event LICENSOR’S liability to LICENSEE on 
any ground, including but not limited to negligence, shall not exceed a sum equal to the fee 
paid to LICENSOR by the LICENSEE hereunder except as provided in paragraph 7 hereunder 
entitled "Patent and Copyright Indemnity". 

7. Patent and Copyright Indemnity. LICENSOR will defend the LICENSEE against a 
claim that a program supplied hereunder infringes a U.S. patent or copyright, LICENSOR will 
pay the resulting cost and damage awards provided that: 

a. The LICENSEE promptly notifies LICENSOR in writing of the claim; and 

b. LICENSOR has sole control of the defense and all related settlement negotia¬ 
tions. 

If such claim has occurred, or in LICENSOR’S opinion is likely to occur, the LICENSEE 
agrees to accept noninfringing replacement programs from LICENSOR, if available, or, if not, 
to return the program on written request by LICENSOR. The LICENSEE will pay only those 
charges which were payable prior to the date of such return. LICENSOR has no liability for 
any claim based upon the combination, operation or use of any program supplied hereunder 
with equipment or data not supplied by LICENSOR, or with any program other than or in addi¬ 
tion to the program supplied by LICENSOR if such claim would have been avoided by use of 
another program whether or not capable of achieving the same results, or based upon 
modification of any program supplied hereunder. 

This indemnity does not cover any material originally supplied to LICENSEE by Western 
Electric under LICENSEE’S UNIX/32V license. 

The foregoing states the entire obligation of LICENSOR with respect to infringement of 
patents and copyrights. 

8. Alterations and Modifications. LICENSEE shall make any alterations, variations, 
modifications, additions or improvements to the Licensed Material, at its own risk and expense 


for its own use and merge it into other program material to form an updated work, provided 
that, up m discontinuance of the License for such Licensed Material the Licensed Matesial sup¬ 
plied by LICENSOR will be completely removed from the updated work and dealt with under 
this Agreement as if permission to modify had never been granted. Any portion of the 
Licensed Material included in an updated work shall be used only on the designated CPU and 
shall remain subject to all other terms of this agreement. 

9. Inspection. LICENSOR shall have the right at all reasonable times to inspect the 
premises of LICENSEE subject to ail LICENSEE’S industrial security and other rules then in 
effect at LICENSEE’S premises; to determine and verify LICENSEE’S compliance with this 
Agreement. 

10. Default. If with regard to any of the Licensed Material, LICENSEE fails to pay any 
charge provided for herein within ten (10) days after written notice that the same is overdue 
and payable, or if LICENSEE with regard to any item or items of Licensed Material fails to 
observe, keep or perform any other provisions of the Agreement required to be observed, kept 
or performed by LICENSEE, LICENSOR shall have the right to exercise any one or more of 
the following remedies: 

(a) To terminate the License herein granted; 

(b) To declare the entire amount of any fee payable under Paragraph 2 hereina¬ 
bove for the entire term of this Agreement immediately due and payable as to 
any or all items of Licensed Material without notice or demand to LICENSEE; 

• (c) To sue for and recover all fees then accrued ©r thereafter accruing, with 

respect to any items of Licensed Material; 

(d) To take possession of any or all items of Licensed Material without demand or 
notice, wherever they may be located, without court order or other process of 
law. LICENSEE hereby waives any and all damages occasioned by such taking 
of possession. No taking of possession shall constitute a termination of this 
Agreement as to any item of Licensed Material unless LICENSOR expressly so 
notifies LICENSEE in writing; 

(e) To terminate this Agreement as to any or all items of Licensed Material; 

(f) In the event of any unauthorized use of the Licensed Material, including, but 
not limited to, unauthorized disclosure to third persons or use by LICENSEE 
of the material at facilities other than those identified in Paragraph 5 above, 
LICENSOR shall at its option have the right in addition to its other remedies, 
to recover from LICENSEE an amount equal to GD the sum LICENSOR would 
have charged the person or persons obtaining the benefit of such unauthorized 
use of the Licensed Material, plus (ii) any amount received by LICENSEE on 
account of such unauthorized use; 

(g) To have the obligations of LICENSEE hereunder specifically performed and to 
have any threatened or actual breach by LICENSEE enjoined, it being ack¬ 
nowledged with respect to the obligations of LICENSEE under Paragraph 5 
hereof that such equitable relief is the only adequate remedy; 

(h) To pursue any other remedy at law or in equl^. Notwithstanding any said 
repossession, or any other action which LICENSOR may take, LICENSEE 
shall be and remain liable for the full performance of all obligations on his/her 
part to be performed under this Agreement. AH such remedies are cumula¬ 
tive, and may be exercised concurrently or separately. 

11. Legal Expenses. In case legal action is taken by either party to enforce this Agree¬ 
ment, all costs and expenses, including reasonable attorney’s fees, incurred by the prevailing 
party in exercising any of its rights or remedies hereunder or in enforcing any of the terms, 
conditions, or provisions hereof shall be paid by the other party. 



12. Assignment. Without he prior written consent of the other, neither party shall (a) 
assign, transfer, pledge or hypotf: cate this Agreement, the Licensed Material or any part 
thereof or any interest therein or ( ) sublet or lend the Licensed Material or any part thereof, 
or permit the Licensed Material or any part thereof to be used by anyone except as specifically 
authorized by Paragraph 5 above. Any consent to any of the foregoing prohibited acts shall 
apply only in the given instance and shall not be deemed a consent to any subsequent like act 
nor a consent to any other act. In the event either party consents to any prohibited act 
hereunder, the other shall, without further request, apprise any third party receiving Licensed 
Material or the use thereof of the restrictions upon use contained in this Agreement. Subject 
always to the foregoing, this Agreement shall bind and inure to the benefit of the parties 
hereto, their successors and assigns. 

13. Severability. If any part, term or provision of this Agreement shall be held illegal, 
unenforceable or in conflict with any law of a federal, state or local government having jurisdic¬ 
tion over this Agreement, the validity of the remaining portions or provisions shall not be 
affected thereby. 

14. Governing Law. This Agreement shall be construed and enforced according to the 
laws of California as applied to contracts made and to be performed in California. 

15. Paragraph Headings. The headings herein are inserted for convenience only and 
shall not be construed to limit or modify the scope of any provision of this Agreement. 

16. Termination. Upon termination of the. lease herein, all Licensed Materials and 
copies thereof shall be returned to LICENSOR. 

17. Installations. Under the terms hereof, LICENSEE is entitled to only one installa¬ 
tion of Licensed Materials. Additional installations requested by LICENSEE will be made by 
LICENSOR under the terms and conditions to be separately negotiated. 

18. Entire Agreement. This Agreement contains all the agreements, representations, 
and understandings of the parties hereto and supersedes any previous understandings, commit¬ 
ments or agreements, oral or written. 


IN WITNESS WHEREOF, the parties hereto have executed this Agreement as of the day 
and year first above written. 


THE REGENTS OF THE 
UNIVERSITY 
OF CALIFORNIA 


By 


(Licensor) 


By 


(Licensee) 
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The Unix Time-Sharing System 

T. A. Dolotta 

Bell Laboratories 
Murray Hill, New Jersey 07974 


ABSTRACT 


UNIX is a trademark for a family of computer operating sys¬ 
tems developed at Bell Laboratories. Over 1,100 of these 
systems, which run on small, to large minicomputers and on 
some large computers, are used within the Bell System for 
program development, for support of telephone operations, 
for text processing, for control of laboratory experiments, for 
computer-aided design, and for general-purpose computing. 
Another 1,300 UNIX systems have been licensed outside the 
Bell System. 

The main objective of the developers of the UNIX system 
was to develop a computing environment in which they 
themselves could comfortably and effectively pursue their 
own work. The result is an operating system of unusual sim¬ 
plicity, generality, and, above all, intelligibility. A distinctive 
software style has grown upon this base. UNIX software 
works smoothly together: elaborate computing tasks are typi¬ 
cally composed from loosely-coupled, small parts that possess 
very simple, general-purpose interfaces, and that are, them¬ 
selves, very often available as “off-the-shelf’ software tools. 

This talk is an overview of UNIX. A concise, annotated 
bibliography of publications that describe UNIX is provided. 

SHARE 54 
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1. Introduction 

This note describes the routine "^backup" in the Unix 
system assembler code (m40.s or m45.s), and the respects in 
•which it is inadequate for the newer small 11s (23, 34, 60). 
It assumes you have a processor handbook for both the 11/45 
and one of the smaller processors, and a copy of John Lion's 
Commentary on the Unix System. If you don't have a Commen¬ 
tary, then the line numbering in the section describing 
backup on an 11/40 may need clarification - the entry point 
to backup in m40»s ( backup!) is line number 1012. 

2. Instruction Backup 

When a user mode segmentation exception occurs, and the 
user SP is below the stack segment, an attempt is ^ made to 
grow the stack automatically. The assembler routine backup 
(1012) is used to reconstruct the situation which existed 
prior to execution of the instruction which caused the trap, 
"grow" (4136) is used to perform the actual extension. The 
case where the trap was not due to the SP going below the 
stack segment is handled by "grow", which returns 0 if the 
SP is already within the stack segment, and a SIGSEG signal 
is indicated. 

3. M emory Management Status Registers 

SSR0 contains abort error flags, memory management 
enable plus 'some other information. If any of the abort 
flags are set, the memory management status registers are 
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frozen until the flags are cleared, to facilitate error 
recovery. . 

The SSR1 register records any aut©increment/decrement 
of the general purpose registers, including explicit refer¬ 
ences through the PC. SSR1 is cleared at the beginning of 
each instruction fetch. Whenever a general purpose register 
is either autoincremented or autodecremented, the register 
number and the amount (in 2's complement notation) by which 
the register was modified is written in SSR1. The informa¬ 
tion contained in SSR1 is necessary to accomplish an effec¬ 
tive recovery from an error resulting in an abort. The low 
order byte is written first, and it is not possible to for a 
PDP-11 instruction to autoincrement/decrement more than two 
general purpose registers per instruction before an "abort 
causing” reference. 


15 11 10 8 7 3 2 0 



Amount changed Register Amount changed Register 
(2's complement) number (2's complement) number 


Format of Status Register SSR1 


SSR2 is loaded with the 16-bit virtual address at the 
beginning of each instruction fetch, but is not updated if 
the instruction fetch fails. Upon an abort, the result of 
SSRO bits 15, 14 or 13 being set, SSR.2 will freeze until the 
SSRO abort flags are cleared. 

4. Trap Handling 

When a trap occurs, the assembler "trap” routine (0755) 
stores the contents of the memory management registers 
(0759) in case they are needed, and the memory management 
unit is reinitialised. 

The process of "backing up" and restarting a partially 
completed instruction involves: 

1. Performing the appropriate memory management tasks to 
alleviate the cause of the abort; 

2. Restoring the general purpose registers indicated in 
SSR1 to their original contents at the start of the 
instruction by subtracting the "modify value" specified 
in SSR1; - 

* 
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3. Restoring the PC to the ’’abort time” PC by loading it 
with the contents of SSR2, which contains the value of 
the virtual PC at the time the "abort generating” 
instruction was fetched. 

The routine backup relies on the ability of the 
hardware to restart a half executed instruction, e.g. the 
instruction 

mov (rO)+,-(sp) , 


may cause a trap when the attempt is made* to push the 
operand, fetched indirectly through rO, onto the stack. At 
this point, register rO has been autoincremented, and must 
be restored to its original value before the instruction is 
re-executed after the stack is grown. The memory management 
available on the larger PDP-lls (44, 45, 55, 70) automati¬ 
cally saves the amount any general purpose register was 
autoincremented/decremented (a maximum of two registers) in 
SSR1. However, this register is missing on the smaller pro¬ 
cessors (23, 34, 40, 60) and the routine "backup” has a lot 
more work to do, and may fail, as not enough information is 
saved by the processor. The classic* example is on the 
instruction 

cmp (sp), — (sp) 

In such a situation, when the source register is the same as 
the destination register, it is impossible to tell which 
half of the instruction caused the fault, and no backup is 
possible. 

It will be useful to consider Instruction backup when 
the full set of memory management registers Is available (as 
on the PDP-11/45), before considering the intricacies caused 
by the absence of SSR1. 

5. PDP-11 /45 Instruction Backup 

Instruction backup with the /45—type segmentation 
hardware Is performed by the following code:- 



•globl 

•globl 

^backup 

__backup 

__regloc 

• 

• 


0001 


mov 

2(sp),r0 

0002 


movb 

ssr+2,rl 

0003 


jsr 

pc. If 

. 0004 


movb 

ssr+3,rl 

0005 


jsr 

pc. If 

.0006 


movb 

__regloc+7,rl 

0007 


asl 

rl 



0008 


add 

r0,rl 




AUUGN 



0009 


mov 

ssr+4,(rl) 

0010 

0011 

2: 

clr 

rO 

0012 

0013 

e 

1: 

rts 

PC 

0014 


mov 

rl,-(sp) 

0015 


asr 

(sp) 

0016 


asr 

(sp) 

0017 


asr 

(sp) 

0018 


bic 

$17,rl 

0019 


movb 

__regloc(rl) ,rl 

0020 


asl 

rl 

0021 


add • 

r0,rl 

0022 


sub 

(sp)+,(rl) 

0023 


rts 

pc 


.Backup is called directly from "trap" (2812) with the 
address of the saved user registers as argument® 

0001 save the pointer to the saved registers in rO; it 

will be used more than once; 

0002 move the lower byte of the saved SSR1 (in "trap”) 

into rl; 

0003 lines 13 to 23 are a subroutine which takes the byte 
specified in rl and updates the general register 
specified therein. The lower byte of rl is of the 
. same format as one of the bytes of SSR1. First, use 
the next location on the stack as a temporary to 

hold the "amount changed" (lines 14 to 17); note the 
implicit sign extension in line 2. Isolate the 

register number (18), and create a pointer to its 

stored location (21) using the "regloc" array 

(2677). Finally, update the stored register value 
(22) and return; 

0004 do the same for the upper byte of SSR1; 

0006 reset the PC to that saved in SSR2 at the time of 

the trap; 

0010 indicate no error on return, "backup" cannot fail! 

6« Instruction Backup without SSR1 

Memory management register SSR1 is missing on the PDP- 
11 /23/34/40/60, and in prder to backup the instruction caus¬ 
ing a segmentation fault, it must be simulated. The backup 
procedure is the same as that for an 11/45, but before the 
registers are updated, the subroutine "backup" is called to 
simulate the function of the missing register, and put the 
required offsets and register numbers into the storage space 






"ssr+2" 


~ 5 - 


1013 


1047 


1050 


1053 


Save the pointer to the saved registers, and the old 
contents of r2 — it will be used as the missing 
register SSR1, and call the routine "backup" to 
simulate the action of the missing register; 

initialise "SSR1"; "bflag" is set if we are dealing 
with a byte instruction, "jflg" is an error condi¬ 
tion if it is set; . 

move the contents of the virtual PC at the time of 
the abort (SSR2) into rO, and call the routine 
"fetch" (1222) to get the instruction returned in rO 
which caused the abort* Fetch returns —1 on error 
and also clears the lower byte of r2 if an error 
occurs - this will be used later. 

the top 4 bits of the instruction are used to deter— 
mine its type" (a numerical op—code list is very 
handy). There may not be enough information to 
determine the type (0 and 10), so a further check is 
done at line 1066, on the next lower 3 bits. The 
main categories are double operand, single operand 
and illegal instruction types (illegal instructions 
in this case are those such as branch instructions 
which do not take an operand and could therefore not 
cause a segmentation exception, as well as true 
illegal instructions such as 11/45 floating point 
opcodes). Those of type "illegal" (1188) simply 
cause jflg to be set and a hasty return to 
"^backup". If a single operand instruction caused 
the fault, then the register of its operand is the 
one which caused tne fault, no others being 
involved, so call "setreg" (1196) to put any regis¬ 
ter number and amount changed into the lower byte of 
r2. 


1196 


Double operand instructions are more complicated; 
setreg is called on both the source and destination 
(1115) (note that the source information is put in 
the high byte, whereas the hardware SSR1 uses the 
low byte first - it doesn't really matter), and then 
the source operand is fetched, following through any 
indirection necessary. This has the effect of clear¬ 
ing the low byte of r2 if a fault occurs (in 
f^tch )• In this way, if the fault was caused by 
the source operand, the destination register would 
not have been altered, and so "backup" should not 
need to restore it. 

The routine "setreg" is interesting as it brings out 
one or two points about the PDP-11 instruction set. 
Displacements only occur with register modes 2, 3, 4 
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and 5 (autoincrement, * autoincrement deferred, 

autodecrement and autodecrement deferred respec¬ 
tively). With modes 3 and 5, the amount changed is 
always 2 bytes, but with modes 2 and 4, the amount 
changed can be 1 byte or 2 bytes, depending on 
whether the instruction is a byte or word instruc¬ 
tion. Also, if the register involved is the stack 
pointer (r6 or SP) or the program counter (r7 or 
PC), the amount changed is always 2 bytes; this is 
checked on lines 1206 to 1209. 

7* Special Cases 

The code in V6's m40.s described by the commentary 
works correctly on the 11/40. However, the other small 
machines have certain peculiarities which require special 
treatment. These include:— 

— FPU instructions. The 11/23+KEF11A, the 11/34-fFPl 1A, 
the 11/60 and the 11/60+FP11F. all process the 11/45 
style floating point instructions. The FPU's operands 
may be 2, 4 or 8 bytes long, and the- general registers 
are adjusted by corresponding amounts. 

- . The 1.1/40 always leaves the auto-incremented or decre¬ 

mented registers in their final state, even if a fetch 
is aborted. In some cases on the other machines, the 
registers are left in their initial state. So far as 
is known at present, these cases are:- 

1* FP11 instructions with operands of 4 or 8 bytes 
length on the 11/60 without FP11E. 

2. Instructions of the form "op (r)+" on the 11/34. 

3* Instructions of the form "op (r)+,dd" on the 
11/34, iff the source faults. 

Of course, if the registers are in their initial state, 
it is unnecessary (and incorrect) to adjust them. How¬ 
ever, "grow()" (4136) examines the sp after backup has 
run, and expands the stack iff the sp points outside 
the valid stack. If the sp is In its initial state, it 
may point to a valid stack address, in which case SIG- 
SEG will be incorrectly signalled. 

After comparing V6's m40.s, V7's m40.s, Stanford's 
m34.s and our own m34.s , we have written a new version"^of 
"backup" which is presently being tested. This supports 
FP11 Instructions, and has facilities for instruccions which 
leave the registers In their initial state.. However, it is 
not a panacea; there is one problem on the 11/34 which 

1) Which was distributed with v6+ at Newcastle. 
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appears insoluble. If the source of an 
there are four possible cases:- 

op (r)+,dd faults. 

Case Before 

• 


After Faults 

1 . 

////// 

<- r 

r -> 

iiiiii 

none 

2. 

—— — 

<- r 

r, not r-2 

r 

A 

1 

mm 




3* r -> 


<- r 

r, not r-2 


mm 




4 * r -> 


<-* r 

r and r-2 


Cases 2 and 3 are indistinguishable after the fact, and so a 
backup failure must be signalled if this situation is 
detected. The behaviour of this new backup on the /34 is 
more conservative than that of Stanford's; it fails rather 
than backup incorrectly. 


•••V 
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PROGRAMMING ANNOUNCEMENT 
New Operating System 

Because so many users have ashed for an operating system of even greater 
capability then VM, IBM announces the Virtual Universe Operating System 
OS/VU. 

Running under OS/VU, the individual user appears to have not merely a 
machine of his own, but an entire universe of his own on which he can 
set up and take down his own programs, data sets, systems networks, 
personnel, and planetary systems. He need only specify the universe he 
desires, and the OS/VU system generation program (IEHGOD) does the rest. 
This program will reside in SYS1.G0DLIB. Ihe minimum time for this 
function is 6 days of activity and 1 day of review. In conjunction with 
OS/VU, all system utilities have been replaced by one program 
(IEHPROPHET) which will reside in SYS1.MESSIAH. This program has.no 
parameters or control cards as it knows what you want to do when it is . 
executed. 

Naturally, the user must have attained a certain degree of 
sophistication in the data processing field if an efficient utilisation 
of OS/VU is to be achieved. Frequent calls to non-resident galaxies, for 
instance, can lead to unexpected delays in the execution of a job. 
Although IBM, through its wholly-owned subsidiary, the United States, is 
working on a program to upgrade the speed of light and thus reduce the 
overhead of extraterrestrial and metadimensional paging, users must be 
careful for the present to stay within the laws of physics. IBM must 
charge an additional fee for violations. 

OS/VU will run on any IBM xOxx equipped with Extended WARP Feature. 
Rental is twenty million dollars per cpu/nanosecond. 

Users should be aware that IBM plans to migrate all existing systems and 
hardware to OS/VU as soon as our engineers effect one output that is 
(conceptually) error-free'. This will give us a base to develop an even 
more powerful operating system, target date 2001, designated "Virtual 
Reality". OS/VR is planned to enable the user to migrate to a totally 
unreal universe. To aid the user in identifying the difference between 
"Virtual Reality" and "Real Reality", a file containing a linear 
arrangement of multisensory total records of successive moments of now 
will be established. It's name will be SYSl.est. 
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-Uni” -Installation Equipment List 

organisation ______ 

department ______ , _ 

address __ : __ 

city __ ' _ 

region _ . _ . 

telephone(organisation) ___ 

telephone(computer room) ___ 

telephone (dial up -line) ____ 


correspondent 
system manager 


extension 

extension 


licence(commer/academ) 
Unix versions licenced 
Unix versions run 
Other systems run 


epu type _ 
epu num _ 
memory (Kb) 


manuf 


model 


tapes 


printers 


plotters! 


displays i. 


other 


networks!. 


agent 


i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

« 

i 

i 

i 

i 

i 

i 

i 

i 


qty description 


cassetteidectape» floppy-disk!paper-tape|card-reader j 

1 i i i , 

— - 1 —. .—•—-1---i - _! 

modem!auto-dialJa/d converters!floating point!cache ! 

! * ! I I . 


I object to this information being made available to: 

Unix group members ___ 

Non group members 












Unix 


UsJl 


organisation 
department 
favoured media 

| utility 


version/description 


S status ! 



N.B. Only major or better software should be mentioned (not Bell ori¬ 
ginals!). A well known utility should be described by giving its 


source 


e.g ! Pascal ! Yrije 


Use combinations of the following characters to give software status, 
Add any others which you think appropriate. 

H - Home grown I - In development 

S - Stored here U - Used here 

W - Works! > D - Documented 

L - Licence required M - Modified here 










Uni* LUii& .I nstallation List 

Luug name ___ ' ___ 

chairman ____ . _ 

address __ 


phone 

correspondent 

address 


phone 


organisation ___ 

department __ 

representatives 

organisation __ 

department _ 

representatives _ 

organisation ____ 

department _ 

representatives _ 

organisation _ 

department _ 

representatives 

organisation __ 

department __ 

representatives . 

organisation __ 

department _ 

representatives 

organisation __ 

department __ 

representatives 

organisation __ 

department _ 

representatives _ 

organisation __ 

department _ 

representatives _ 

organisation ._ 

department ._ 

representatives _ 


*. ,a b 





